Can we reliably automate clinical prognostic modelling?: A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands

Dutch COVID-19 Research Consortium

doi:https://doi.org/10.1016/j.ijmedinf.2022.104688

Can we reliably automate clinical prognostic modelling? A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands

Dutch COVID-19 Research Consortium

Research output: Contribution to journal › Article › Academic › peer-review

4 Citations (Scopus)

Abstract

Background: Building Machine Learning (ML) models in healthcare may suffer from time-consuming and potentially biased pre-selection of predictors by hand that can result in limited or trivial selection of suitable models. We aimed to assess the predictive performance of automating the process of building ML models (AutoML) in-hospital mortality prediction modelling of triage COVID-19 patients at ICU admission versus expert-based predictor pre-selection followed by logistic regression. Methods: We conducted an observational study of all COVID-19 patients admitted to Dutch ICUs between February and July 2020. We included 2,690 COVID-19 patients from 70 ICUs participating in the Dutch National Intensive Care Evaluation (NICE) registry. The main outcome measure was in-hospital mortality. We asessed model performance (at admission and after 24h, respectively) of AutoML compared to the more traditional approach of predictor pre-selection and logistic regression. Findings: Predictive performance of the autoML models with variables available at admission shows fair discrimination (average AUROC = 0·75-0·76 (sdev = 0·03), PPV = 0·70-0·76 (sdev = 0·1) at cut-off = 0·3 (the observed mortality rate), and good calibration. This performance is on par with a logistic regression model with selection of patient variables by three experts (average AUROC = 0·78 (sdev = 0·03) and PPV = 0·79 (sdev = 0·2)). Extending the models with variables that are available at 24h after admission resulted in models with higher predictive performance (average AUROC = 0·77-0·79 (sdev = 0·03) and PPV = 0·79-0·80 (sdev = 0·10-0·17)). Conclusions: AutoML delivers prediction models with fair discriminatory performance, and good calibration and accuracy, which is as good as regression models with expert-based predictor pre-selection. In the context of the restricted availability of data in an ICU quality registry, extending the models with variables that are available at 24h after admission showed small (but significantly) performance increase.

Original language	English
Article number	104688
Journal	International Journal of Medical Informatics
Volume	160
DOIs	https://doi.org/10.1016/j.ijmedinf.2022.104688 https://doi.org/10.1016/j.ijmedinf.2022.104688
Publication status	Published - 1 Apr 2022

Access to Document

Cite this

Dutch COVID-19 Research Consortium (2022). Can we reliably automate clinical prognostic modelling? A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands. International Journal of Medical Informatics, 160, Article 104688. https://doi.org/10.1016/j.ijmedinf.2022.104688, https://doi.org/10.1016/j.ijmedinf.2022.104688

@article{d4320e07453c4cbeb47f357ac33bd84f,

title = "Can we reliably automate clinical prognostic modelling?: A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands",

abstract = "Background: Building Machine Learning (ML) models in healthcare may suffer from time-consuming and potentially biased pre-selection of predictors by hand that can result in limited or trivial selection of suitable models. We aimed to assess the predictive performance of automating the process of building ML models (AutoML) in-hospital mortality prediction modelling of triage COVID-19 patients at ICU admission versus expert-based predictor pre-selection followed by logistic regression. Methods: We conducted an observational study of all COVID-19 patients admitted to Dutch ICUs between February and July 2020. We included 2,690 COVID-19 patients from 70 ICUs participating in the Dutch National Intensive Care Evaluation (NICE) registry. The main outcome measure was in-hospital mortality. We asessed model performance (at admission and after 24h, respectively) of AutoML compared to the more traditional approach of predictor pre-selection and logistic regression. Findings: Predictive performance of the autoML models with variables available at admission shows fair discrimination (average AUROC = 0·75-0·76 (sdev = 0·03), PPV = 0·70-0·76 (sdev = 0·1) at cut-off = 0·3 (the observed mortality rate), and good calibration. This performance is on par with a logistic regression model with selection of patient variables by three experts (average AUROC = 0·78 (sdev = 0·03) and PPV = 0·79 (sdev = 0·2)). Extending the models with variables that are available at 24h after admission resulted in models with higher predictive performance (average AUROC = 0·77-0·79 (sdev = 0·03) and PPV = 0·79-0·80 (sdev = 0·10-0·17)). Conclusions: AutoML delivers prediction models with fair discriminatory performance, and good calibration and accuracy, which is as good as regression models with expert-based predictor pre-selection. In the context of the restricted availability of data in an ICU quality registry, extending the models with variables that are available at 24h after admission showed small (but significantly) performance increase.",

author = "{Dutch COVID-19 Research Consortium} and I Vagliano and S Brinkman and A Abu-Hanna and Arbous, {M S} and Dongelmans, {D A} and Elbers, {P W G} and {de Lange}, {D W} and {van der Schaar}, M and {de Keizer}, {N F} and Schut, {M C}",

note = "Funding Information: This research was funded by The Netherlands Organisation for Health Research and Development (ZonMw) COVID-19 Programme in the bottom-up focus area 1 “Predictive diagnostics and treatment” for theme 3 “Risk analysis and prognostics” (project number 10430 01 201 0011: IRIS). The funder had no role in the design of the study or writing the manuscript. Publisher Copyright: {\textcopyright} 2022 The Author(s)",

year = "2022",

month = apr,

day = "1",

doi = "https://doi.org/10.1016/j.ijmedinf.2022.104688",

language = "English",

volume = "160",

journal = "International Journal of Medical Informatics",

issn = "1386-5056",

publisher = "Elsevier Ireland Ltd",

}

Dutch COVID-19 Research Consortium 2022, 'Can we reliably automate clinical prognostic modelling? A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands', International Journal of Medical Informatics, vol. 160, 104688. https://doi.org/10.1016/j.ijmedinf.2022.104688, https://doi.org/10.1016/j.ijmedinf.2022.104688

Can we reliably automate clinical prognostic modelling? A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands. / Dutch COVID-19 Research Consortium.
In: International Journal of Medical Informatics, Vol. 160, 104688, 01.04.2022.

Research output: Contribution to journal › Article › Academic › peer-review

TY - JOUR

T1 - Can we reliably automate clinical prognostic modelling?

T2 - A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands

AU - Dutch COVID-19 Research Consortium

AU - Vagliano, I

AU - Brinkman, S

AU - Abu-Hanna, A

AU - Arbous, M S

AU - Dongelmans, D A

AU - Elbers, P W G

AU - de Lange, D W

AU - van der Schaar, M

AU - de Keizer, N F

AU - Schut, M C

N1 - Funding Information: This research was funded by The Netherlands Organisation for Health Research and Development (ZonMw) COVID-19 Programme in the bottom-up focus area 1 “Predictive diagnostics and treatment” for theme 3 “Risk analysis and prognostics” (project number 10430 01 201 0011: IRIS). The funder had no role in the design of the study or writing the manuscript. Publisher Copyright: © 2022 The Author(s)

PY - 2022/4/1

Y1 - 2022/4/1

N2 - Background: Building Machine Learning (ML) models in healthcare may suffer from time-consuming and potentially biased pre-selection of predictors by hand that can result in limited or trivial selection of suitable models. We aimed to assess the predictive performance of automating the process of building ML models (AutoML) in-hospital mortality prediction modelling of triage COVID-19 patients at ICU admission versus expert-based predictor pre-selection followed by logistic regression. Methods: We conducted an observational study of all COVID-19 patients admitted to Dutch ICUs between February and July 2020. We included 2,690 COVID-19 patients from 70 ICUs participating in the Dutch National Intensive Care Evaluation (NICE) registry. The main outcome measure was in-hospital mortality. We asessed model performance (at admission and after 24h, respectively) of AutoML compared to the more traditional approach of predictor pre-selection and logistic regression. Findings: Predictive performance of the autoML models with variables available at admission shows fair discrimination (average AUROC = 0·75-0·76 (sdev = 0·03), PPV = 0·70-0·76 (sdev = 0·1) at cut-off = 0·3 (the observed mortality rate), and good calibration. This performance is on par with a logistic regression model with selection of patient variables by three experts (average AUROC = 0·78 (sdev = 0·03) and PPV = 0·79 (sdev = 0·2)). Extending the models with variables that are available at 24h after admission resulted in models with higher predictive performance (average AUROC = 0·77-0·79 (sdev = 0·03) and PPV = 0·79-0·80 (sdev = 0·10-0·17)). Conclusions: AutoML delivers prediction models with fair discriminatory performance, and good calibration and accuracy, which is as good as regression models with expert-based predictor pre-selection. In the context of the restricted availability of data in an ICU quality registry, extending the models with variables that are available at 24h after admission showed small (but significantly) performance increase.

AB - Background: Building Machine Learning (ML) models in healthcare may suffer from time-consuming and potentially biased pre-selection of predictors by hand that can result in limited or trivial selection of suitable models. We aimed to assess the predictive performance of automating the process of building ML models (AutoML) in-hospital mortality prediction modelling of triage COVID-19 patients at ICU admission versus expert-based predictor pre-selection followed by logistic regression. Methods: We conducted an observational study of all COVID-19 patients admitted to Dutch ICUs between February and July 2020. We included 2,690 COVID-19 patients from 70 ICUs participating in the Dutch National Intensive Care Evaluation (NICE) registry. The main outcome measure was in-hospital mortality. We asessed model performance (at admission and after 24h, respectively) of AutoML compared to the more traditional approach of predictor pre-selection and logistic regression. Findings: Predictive performance of the autoML models with variables available at admission shows fair discrimination (average AUROC = 0·75-0·76 (sdev = 0·03), PPV = 0·70-0·76 (sdev = 0·1) at cut-off = 0·3 (the observed mortality rate), and good calibration. This performance is on par with a logistic regression model with selection of patient variables by three experts (average AUROC = 0·78 (sdev = 0·03) and PPV = 0·79 (sdev = 0·2)). Extending the models with variables that are available at 24h after admission resulted in models with higher predictive performance (average AUROC = 0·77-0·79 (sdev = 0·03) and PPV = 0·79-0·80 (sdev = 0·10-0·17)). Conclusions: AutoML delivers prediction models with fair discriminatory performance, and good calibration and accuracy, which is as good as regression models with expert-based predictor pre-selection. In the context of the restricted availability of data in an ICU quality registry, extending the models with variables that are available at 24h after admission showed small (but significantly) performance increase.

UR - http://www.scopus.com/inward/record.url?scp=85123782596&partnerID=8YFLogxK

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85123782596&origin=inward

UR - https://www.ncbi.nlm.nih.gov/pubmed/35114522

U2 - https://doi.org/10.1016/j.ijmedinf.2022.104688

DO - https://doi.org/10.1016/j.ijmedinf.2022.104688

M3 - Article

C2 - 35114522

SN - 1386-5056

VL - 160

JO - International Journal of Medical Informatics

JF - International Journal of Medical Informatics

M1 - 104688

ER -

Dutch COVID-19 Research Consortium. Can we reliably automate clinical prognostic modelling? A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands. International Journal of Medical Informatics. 2022 Apr 1;160:104688. doi: https://doi.org/10.1016/j.ijmedinf.2022.104688, https://doi.org/10.1016/j.ijmedinf.2022.104688

Can we reliably automate clinical prognostic modelling? A retrospective cohort study for ICU triage prediction of in-hospital mortality of COVID-19 patients in the Netherlands

Abstract

Access to Document

Other files and links

Cite this