TY - JOUR
T1 - An integrated approach to geographic validation helped scrutinize prediction model performance and its variability
AU - Yordanov, Tsvetan R.
AU - Lopes, Ricardo R.
AU - Ravelli, Anita C. J.
AU - Vis, Marije
AU - NHR THI Registration Committee
AU - Houterman, Saskia
AU - Marquering, Henk
AU - Abu-Hanna, Ameen
N1 - Funding Information: Declaration of interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Publisher Copyright: © 2023 The Author(s)
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Objectives: To illustrate in-depth validation of prediction models developed on multicenter data. Methods: For each hospital in a multicenter registry, we evaluated predictive performance of a 30-day mortality prediction model for transcatheter aortic valve implantation (TAVI) using the Netherlands heart registration (NHR) dataset. We measured discrimination and calibration per hospital in a leave-center-out analysis (LCOA). Meta-analysis was used to calculate I2 values per performance metric from the LCOA and to compute mean and confidence interval (CI) estimates. Case mix differences between studies were inspected using the framework of Debray et al. for understanding external validation. We also aimed to discover subgroups (SGs) with high model prediction error (PE) and their distribution over the centers. Results: We studied 16 hospitals with 11,599 TAVI patients with an early mortality of 3.7%. The models’ area under the curve (AUCs) had a wide range between hospitals from 0.59 to 0.79, and miscalibration occurred in seven hospitals. Mean AUC from meta-analysis was 0.68 (95% CI 0.65-0.70). I2 values were 0%, 74%, and 0% for AUC, calibration intercept and slope, respectively. Between-hospital case-mix differences were substantial, and model transportability was low. One SG was discovered with marked global PE and was associated with poor performance on validation centers. Conclusion: The illustrated combination of approaches provides useful insights to inspect multicenter-based prediction models, and it exposes their limitations in transportability and performance variability when applied to different populations.
AB - Objectives: To illustrate in-depth validation of prediction models developed on multicenter data. Methods: For each hospital in a multicenter registry, we evaluated predictive performance of a 30-day mortality prediction model for transcatheter aortic valve implantation (TAVI) using the Netherlands heart registration (NHR) dataset. We measured discrimination and calibration per hospital in a leave-center-out analysis (LCOA). Meta-analysis was used to calculate I2 values per performance metric from the LCOA and to compute mean and confidence interval (CI) estimates. Case mix differences between studies were inspected using the framework of Debray et al. for understanding external validation. We also aimed to discover subgroups (SGs) with high model prediction error (PE) and their distribution over the centers. Results: We studied 16 hospitals with 11,599 TAVI patients with an early mortality of 3.7%. The models’ area under the curve (AUCs) had a wide range between hospitals from 0.59 to 0.79, and miscalibration occurred in seven hospitals. Mean AUC from meta-analysis was 0.68 (95% CI 0.65-0.70). I2 values were 0%, 74%, and 0% for AUC, calibration intercept and slope, respectively. Between-hospital case-mix differences were substantial, and model transportability was low. One SG was discovered with marked global PE and was associated with poor performance on validation centers. Conclusion: The illustrated combination of approaches provides useful insights to inspect multicenter-based prediction models, and it exposes their limitations in transportability and performance variability when applied to different populations.
KW - Calibration
KW - Discrimination
KW - Heterogeneity
KW - Multicenter
KW - Prediction models
KW - Subgroup discovery
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85151006326&origin=inward
UR - https://www.ncbi.nlm.nih.gov/pubmed/36822443
UR - http://www.scopus.com/inward/record.url?scp=85151006326&partnerID=8YFLogxK
U2 - https://doi.org/10.1016/j.jclinepi.2023.02.021
DO - https://doi.org/10.1016/j.jclinepi.2023.02.021
M3 - Article
C2 - 36822443
SN - 0895-4356
VL - 157
SP - 13
EP - 21
JO - Journal of Clinical Epidemiology
JF - Journal of Clinical Epidemiology
ER -