TY - JOUR
T1 - Temporal validation of 30-day mortality prediction models for transcatheter aortic valve implantation using statistical process control – An observational study in a national population
AU - Lopes, Ricardo R.
AU - Yordanov, Tsvetan T. R.
AU - Ravelli, Anita A. C. J.
AU - Houterman, Saskia
AU - Vis, Marije
AU - de Mol, Bas A. J. M.
AU - Marquering, Henk
AU - Abu-Hanna, Ameen
N1 - Funding Information: Mr Ricardo Ricci Lopes was supported by ITEA3 { 16017 }. Publisher Copyright: © 2023 The Authors
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Background: Various mortality prediction models for Transcatheter Aortic Valve Implantation (TAVI) have been developed in the past years. The effect of time on the performance of such models, however, is unclear given the improvements in the procedure and changes in patient selection, potentially jeopardizing the usefulness of the prediction models in clinical practice. We aim to explore how time affects the performance and stability of different types of prediction models of 30-day mortality after TAVI. Methods: We developed both parametric (Logistic Regression) and non-parametric (XGBoost) models to predict 30-day mortality after TAVI using data from the Netherlands Heart Registration. The models were trained with data from 2013 to the beginning of 2016 and pre-control charts from Statistical Process Control were used to analyse how time affects the models' performance on independent data from the mid of 2016 to the end of 2019. The area under the Receiver Operating Characteristics curve (AUC) was used to evaluate the models in terms of discrimination and the Brier Score (BS), which is related to calibration, in terms of accuracy of the predicted probabilities. To understand the extent to which refitting the models contribute to the models’ stability, we also allowed the models to be updated over time. Results: We included data from 11,291 consecutive TAVI patients from hospitals in the Netherlands. The parametric model without re-training had a median AUC of 0.64 (IQR 0.54–0.73) and BS of 0.028 (IQR 0.021–0.035). For the non-parametric model, the median AUC was 0.63 (IQR 0.48–0.68) and BS was 0.027 (IQR 0.021–0.036). Over time, the developed parametric model was stable in terms of AUC and unstable in terms of BS. The non-parametric model was considered unstable in both AUC and BS. Repeated model refitting resulted in stable models in terms of AUC and decreased the variability of BS, although BS was still unstable. The refitted parametric model had a median AUC of 0.66 (IQR 0.57–0.73) and BS of 0.027 (IQR 0.020–0.035) while the non-parametric model had a median AUC of 0.66 (IQR 0.57–0.74) and BS of 0.027 (IQR 0.023–0.035). Conclusions: The temporal validation of the TAVI 30-day mortality prediction models showed that the models refitted over time are more stable and accurate when compared to the frozen models. This highlights the importance of repeatedly refitted models over time to improve or at least maintain their performance stability. The non-parametric approach did not show improvement over the parametric approach.
AB - Background: Various mortality prediction models for Transcatheter Aortic Valve Implantation (TAVI) have been developed in the past years. The effect of time on the performance of such models, however, is unclear given the improvements in the procedure and changes in patient selection, potentially jeopardizing the usefulness of the prediction models in clinical practice. We aim to explore how time affects the performance and stability of different types of prediction models of 30-day mortality after TAVI. Methods: We developed both parametric (Logistic Regression) and non-parametric (XGBoost) models to predict 30-day mortality after TAVI using data from the Netherlands Heart Registration. The models were trained with data from 2013 to the beginning of 2016 and pre-control charts from Statistical Process Control were used to analyse how time affects the models' performance on independent data from the mid of 2016 to the end of 2019. The area under the Receiver Operating Characteristics curve (AUC) was used to evaluate the models in terms of discrimination and the Brier Score (BS), which is related to calibration, in terms of accuracy of the predicted probabilities. To understand the extent to which refitting the models contribute to the models’ stability, we also allowed the models to be updated over time. Results: We included data from 11,291 consecutive TAVI patients from hospitals in the Netherlands. The parametric model without re-training had a median AUC of 0.64 (IQR 0.54–0.73) and BS of 0.028 (IQR 0.021–0.035). For the non-parametric model, the median AUC was 0.63 (IQR 0.48–0.68) and BS was 0.027 (IQR 0.021–0.036). Over time, the developed parametric model was stable in terms of AUC and unstable in terms of BS. The non-parametric model was considered unstable in both AUC and BS. Repeated model refitting resulted in stable models in terms of AUC and decreased the variability of BS, although BS was still unstable. The refitted parametric model had a median AUC of 0.66 (IQR 0.57–0.73) and BS of 0.027 (IQR 0.020–0.035) while the non-parametric model had a median AUC of 0.66 (IQR 0.57–0.74) and BS of 0.027 (IQR 0.023–0.035). Conclusions: The temporal validation of the TAVI 30-day mortality prediction models showed that the models refitted over time are more stable and accurate when compared to the frozen models. This highlights the importance of repeatedly refitted models over time to improve or at least maintain their performance stability. The non-parametric approach did not show improvement over the parametric approach.
KW - Aortic stenosis
KW - Machine learning
KW - Prediction models
KW - Statistical process control
KW - Temporal validation
KW - Transcatheter aortic valve implantation
UR - http://www.scopus.com/inward/record.url?scp=85162098426&partnerID=8YFLogxK
U2 - https://doi.org/10.1016/j.heliyon.2023.e17139
DO - https://doi.org/10.1016/j.heliyon.2023.e17139
M3 - Article
C2 - 37484279
SN - 2405-8440
VL - 9
SP - e17139
JO - Heliyon
JF - Heliyon
IS - 6
M1 - e17139
ER -