TY - CHAP
T1 - Dealing with missing data, small sample sizes, and heterogeneity in machine learning studies of brain disorders
AU - Thomas, Rajat M.
AU - Bruin, Willem
AU - Zhutovsky, Paul
AU - van Wingen, Guido
PY - 2019/1/1
Y1 - 2019/1/1
N2 - In this chapter we explore three of the most common challenges in the application of machine learning techniques in brain disorders research: missing data, small sample sizes, and heterogeneity. After defining these challenges, we present a simple algorithm to generate data that are similar to a “real” dataset using pairwise correlations. This algorithm enables the reader to test the various strategies that are discussed later in the chapter. We then discuss a range of strategies that are currently available to mitigate the impact of missing data, small sample sizes, and heterogeneity on the results. As part of this discussion, we cover both classical strategies and state-of-the-art approaches based on neural networks. We conclude by providing a summary of key recommendations.
AB - In this chapter we explore three of the most common challenges in the application of machine learning techniques in brain disorders research: missing data, small sample sizes, and heterogeneity. After defining these challenges, we present a simple algorithm to generate data that are similar to a “real” dataset using pairwise correlations. This algorithm enables the reader to test the various strategies that are discussed later in the chapter. We then discuss a range of strategies that are currently available to mitigate the impact of missing data, small sample sizes, and heterogeneity on the results. As part of this discussion, we cover both classical strategies and state-of-the-art approaches based on neural networks. We conclude by providing a summary of key recommendations.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85079503581&origin=inward
U2 - https://doi.org/10.1016/B978-0-12-815739-8.00014-6
DO - https://doi.org/10.1016/B978-0-12-815739-8.00014-6
M3 - Chapter
T3 - Machine Learning: Methods and Applications to Brain Disorders
SP - 249
EP - 266
BT - Machine Learning: Methods and Applications to Brain Disorders
PB - Elsevier
ER -