TY - JOUR
T1 - Pitfalls of linear regression for estimating slopes over time and how to avoid them by using linear mixed-effects models
AU - Janmaat, Cynthia J.
AU - van Diepen, Merel
AU - Tsonaka, Roula
AU - Jager, Kitty J.
AU - Zoccali, Carmine
AU - Dekker, Friedo W.
PY - 2019
Y1 - 2019
N2 - Clinical epidemiological studies often focus on investigating the underlying causes of disease. For instance, a nephrologist may be interested in the association between blood pressure and the development of chronic kidney disease (CKD). However, instead of focusing on the mere occurrence of CKD, the decline of kidney function over time might be the outcome of interest. For examining this kidney function trajectory, patients are typically followed over time with their kidney function estimated at several time points. During follow-up, some patients may drop out earlier than others and for different reasons. Furthermore, some patients may have greater kidney function at study entry or faster kidney function decline than others. Also, a substantial heterogeneity may exist in the number of kidney function estimates available for each patient. This heterogeneity with respect to kidney function, dropout and number of kidney function estimates is important to take into account when estimating kidney function trajectories. In general, two methods are used in the literature to estimate kidney function trajectories over time: linear regression to estimate individual slopes and the linear mixed-effects model (LMM), i.e. repeated measures analysis. Importantly, the linear regression method does not properly take into account the above-mentioned heterogeneity, whereas the LMM is able to retain all information and variability in the data. However, the underlying concepts, use and interpretation of LMMs are not always straightforward. Therefore we illustrate this using a clinical example and offer a framework of how to model and interpret the LMM.
AB - Clinical epidemiological studies often focus on investigating the underlying causes of disease. For instance, a nephrologist may be interested in the association between blood pressure and the development of chronic kidney disease (CKD). However, instead of focusing on the mere occurrence of CKD, the decline of kidney function over time might be the outcome of interest. For examining this kidney function trajectory, patients are typically followed over time with their kidney function estimated at several time points. During follow-up, some patients may drop out earlier than others and for different reasons. Furthermore, some patients may have greater kidney function at study entry or faster kidney function decline than others. Also, a substantial heterogeneity may exist in the number of kidney function estimates available for each patient. This heterogeneity with respect to kidney function, dropout and number of kidney function estimates is important to take into account when estimating kidney function trajectories. In general, two methods are used in the literature to estimate kidney function trajectories over time: linear regression to estimate individual slopes and the linear mixed-effects model (LMM), i.e. repeated measures analysis. Importantly, the linear regression method does not properly take into account the above-mentioned heterogeneity, whereas the LMM is able to retain all information and variability in the data. However, the underlying concepts, use and interpretation of LMMs are not always straightforward. Therefore we illustrate this using a clinical example and offer a framework of how to model and interpret the LMM.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85064523856&origin=inward
UR - https://www.ncbi.nlm.nih.gov/pubmed/29796633
U2 - https://doi.org/10.1093/ndt/gfy128
DO - https://doi.org/10.1093/ndt/gfy128
M3 - Article
C2 - 29796633
SN - 0931-0509
VL - 34
SP - 561
EP - 566
JO - Nephrology, dialysis, transplantation
JF - Nephrology, dialysis, transplantation
IS - 4
ER -