TY - JOUR
T1 - The added value of temporal data and the best way to handle it
T2 - A use-case for atrial fibrillation using general practitioner data
AU - Bennis, Frank C.
AU - Aussems, Claire
AU - Korevaar, Joke C.
AU - Hoogendoorn, Mark
N1 - Publisher Copyright: © 2024 The Author(s)
PY - 2024/3/1
Y1 - 2024/3/1
N2 - Introduction: Temporal data has numerous challenges for deep learning such as irregularity of sampling. New algorithms are being developed that can handle these temporal challenges better. However, it is unclear how the performance ranges from classical non-temporal models to newly developed algorithms. Therefore, this study compares different non-temporal and temporal algorithms for a relevant use case, the prediction of atrial fibrillation (AF) using general practitioner (GP) data. Methods: Three datasets with a 365-day observation window and prediction windows of 14, 180 and 360 days were used. Data consisted of medication, lab, symptom, and chronic diseases codings registered by the GP. The benchmark discarded temporality and used logistic regression, XGBoost models and neural networks on the presence of codings over the whole year. Pattern data extracted common patterns of GP codings and tested using the same algorithms. LSTM and CKConv models were trained as models incorporating temporality. Results: Algorithms which incorporated temporality (LSTM and CKConv, (max AUC 0.734 at 360 days prediction window) outperformed both benchmark and pattern algorithms (max AUC 0.723, with a significant improvement using the 360 days prediction window (p = 0.04). The difference between the benchmark and the LSTM or CKConv algorithm decreased with smaller prediction windows, indicating temporal importance for longer prediction windows. The CKConv and LSTM algorithm performed similarly, possibly due to limited sequence length. Conclusion: Temporal models outperformed non-temporal models for the prediction of AF. For temporal models, CKConv is a promising algorithm to handle temporal data using GP data as it can handle irregular data.
AB - Introduction: Temporal data has numerous challenges for deep learning such as irregularity of sampling. New algorithms are being developed that can handle these temporal challenges better. However, it is unclear how the performance ranges from classical non-temporal models to newly developed algorithms. Therefore, this study compares different non-temporal and temporal algorithms for a relevant use case, the prediction of atrial fibrillation (AF) using general practitioner (GP) data. Methods: Three datasets with a 365-day observation window and prediction windows of 14, 180 and 360 days were used. Data consisted of medication, lab, symptom, and chronic diseases codings registered by the GP. The benchmark discarded temporality and used logistic regression, XGBoost models and neural networks on the presence of codings over the whole year. Pattern data extracted common patterns of GP codings and tested using the same algorithms. LSTM and CKConv models were trained as models incorporating temporality. Results: Algorithms which incorporated temporality (LSTM and CKConv, (max AUC 0.734 at 360 days prediction window) outperformed both benchmark and pattern algorithms (max AUC 0.723, with a significant improvement using the 360 days prediction window (p = 0.04). The difference between the benchmark and the LSTM or CKConv algorithm decreased with smaller prediction windows, indicating temporal importance for longer prediction windows. The CKConv and LSTM algorithm performed similarly, possibly due to limited sequence length. Conclusion: Temporal models outperformed non-temporal models for the prediction of AF. For temporal models, CKConv is a promising algorithm to handle temporal data using GP data as it can handle irregular data.
KW - Atrial fibrillation
KW - CKConv
KW - Clinical prediction
KW - Convolutional neural networks
KW - General practitioner
KW - Temporal
UR - http://www.scopus.com/inward/record.url?scp=85186436054&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85186436054&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2024.108097
DO - 10.1016/j.compbiomed.2024.108097
M3 - Article
C2 - 38412689
SN - 0010-4825
VL - 171
SP - 1
EP - 11
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
M1 - 108097
ER -