Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal

I. Vagliano; N. Dormosh; M. Rios; T. T. Luik; T. M. Buonocore; P. W. G. Elbers; D. A. Dongelmans; M. C. Schut; A. Abu-Hanna

doi:https://doi.org/10.1016/j.jbi.2023.104504

Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal

I. Vagliano, N. Dormosh, M. Rios, T. T. Luik, T. M. Buonocore, P. W. G. Elbers, D. A. Dongelmans, M. C. Schut, A. Abu-Hanna

Research output: Contribution to journal › Review article › Academic › peer-review

Abstract

OBJECTIVE: To review and critically appraise published and preprint reports of prognostic models of in-hospital mortality of patients in the intensive-care unit (ICU) based on neural representations (embeddings) of clinical notes.

METHODS: PubMed and arXiv were searched up to August 1, 2022. At least two reviewers independently selected the studies that developed a prognostic model of in-hospital mortality of intensive-care patients using free-text represented as embeddings and extracted data using the CHARMS checklist. Risk of bias was assessed using PROBAST. Reporting on the model was assessed with the TRIPOD guideline. To assess the machine learning components that were used in the models, we present a new descriptive framework based on different techniques to represent text and provide predictions from text. The study protocol was registered in the PROSPERO database (CRD42022354602).

RESULTS: Eighteen studies out of 2,825 were included. All studies used the publicly-available MIMIC dataset. Context-independent word embeddings are widely used. Model discrimination was provided by all studies (AUROC 0.75-0.96), but measures of calibration were scarce. Seven studies used both structural clinical variables and notes. Model discrimination improved when adding clinical notes to variables. None of the models was externally validated and often a simple train/test split was used for internal validation. Our critical appraisal demonstrated a high risk of bias in all studies and concerns regarding their applicability in clinical practice.

CONCLUSION: All studies used a neural architecture for prediction and were based on one publicly available dataset. Clinical notes were reported to improve predictive performance when used in addition to only clinical variables. Most studies had methodological, reporting, and applicability issues. We recommend reporting both model discrimination and calibration, using additional data sources, and using more robust evaluation strategies, including prospective and external validation. Finally, sharing data and code is encouraged to improve study reproducibility.

Original language	English
Article number	104504
Pages (from-to)	104504
Journal	Journal of biomedical informatics
Volume	146
Early online date	22 Sept 2023
DOIs	https://doi.org/10.1016/j.jbi.2023.104504
Publication status	Published - 1 Oct 2023

Keywords

Intensive care
Machine learning
Mortality
Natural language processing
Prognostic models
Systematic review

Access to Document

https://doi.org/10.1016/j.jbi.2023.104504

Cite this

Vagliano, I., Dormosh, N., Rios, M., Luik, T. T., Buonocore, T. M., Elbers, P. W. G., Dongelmans, D. A., Schut, M. C., & Abu-Hanna, A. (2023). Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal. Journal of biomedical informatics, 146, 104504. Article 104504. https://doi.org/10.1016/j.jbi.2023.104504

@article{a901434094e941afa8683115da228ec9,

title = "Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal",

abstract = "OBJECTIVE: To review and critically appraise published and preprint reports of prognostic models of in-hospital mortality of patients in the intensive-care unit (ICU) based on neural representations (embeddings) of clinical notes.METHODS: PubMed and arXiv were searched up to August 1, 2022. At least two reviewers independently selected the studies that developed a prognostic model of in-hospital mortality of intensive-care patients using free-text represented as embeddings and extracted data using the CHARMS checklist. Risk of bias was assessed using PROBAST. Reporting on the model was assessed with the TRIPOD guideline. To assess the machine learning components that were used in the models, we present a new descriptive framework based on different techniques to represent text and provide predictions from text. The study protocol was registered in the PROSPERO database (CRD42022354602).RESULTS: Eighteen studies out of 2,825 were included. All studies used the publicly-available MIMIC dataset. Context-independent word embeddings are widely used. Model discrimination was provided by all studies (AUROC 0.75-0.96), but measures of calibration were scarce. Seven studies used both structural clinical variables and notes. Model discrimination improved when adding clinical notes to variables. None of the models was externally validated and often a simple train/test split was used for internal validation. Our critical appraisal demonstrated a high risk of bias in all studies and concerns regarding their applicability in clinical practice.CONCLUSION: All studies used a neural architecture for prediction and were based on one publicly available dataset. Clinical notes were reported to improve predictive performance when used in addition to only clinical variables. Most studies had methodological, reporting, and applicability issues. We recommend reporting both model discrimination and calibration, using additional data sources, and using more robust evaluation strategies, including prospective and external validation. Finally, sharing data and code is encouraged to improve study reproducibility.",

keywords = "Intensive care, Machine learning, Mortality, Natural language processing, Prognostic models, Systematic review",

author = "I. Vagliano and N. Dormosh and M. Rios and Luik, {T. T.} and Buonocore, {T. M.} and Elbers, {P. W. G.} and Dongelmans, {D. A.} and Schut, {M. C.} and A. Abu-Hanna",

note = "Publisher Copyright: {\textcopyright} 2023 The Authors",

year = "2023",

month = oct,

day = "1",

doi = "https://doi.org/10.1016/j.jbi.2023.104504",

language = "English",

volume = "146",

pages = "104504",

journal = "Journal of biomedical informatics",

issn = "1532-0464",

publisher = "Academic Press Inc.",

}

Vagliano, I , Dormosh, N, Rios, M, Luik, TT, Buonocore, TM, Elbers, PWG , Dongelmans, DA , Schut, MC & Abu-Hanna, A 2023, 'Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal', Journal of biomedical informatics, vol. 146, 104504, pp. 104504. https://doi.org/10.1016/j.jbi.2023.104504

Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal. / Vagliano, I.; Dormosh, N.; Rios, M. et al.
In: Journal of biomedical informatics, Vol. 146, 104504, 01.10.2023, p. 104504.

Research output: Contribution to journal › Review article › Academic › peer-review

TY - JOUR

T1 - Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text

T2 - a systematic review and critical appraisal

AU - Vagliano, I.

AU - Dormosh, N.

AU - Rios, M.

AU - Luik, T. T.

AU - Buonocore, T. M.

AU - Elbers, P. W. G.

AU - Dongelmans, D. A.

AU - Schut, M. C.

AU - Abu-Hanna, A.

PY - 2023/10/1

Y1 - 2023/10/1

N2 - OBJECTIVE: To review and critically appraise published and preprint reports of prognostic models of in-hospital mortality of patients in the intensive-care unit (ICU) based on neural representations (embeddings) of clinical notes.METHODS: PubMed and arXiv were searched up to August 1, 2022. At least two reviewers independently selected the studies that developed a prognostic model of in-hospital mortality of intensive-care patients using free-text represented as embeddings and extracted data using the CHARMS checklist. Risk of bias was assessed using PROBAST. Reporting on the model was assessed with the TRIPOD guideline. To assess the machine learning components that were used in the models, we present a new descriptive framework based on different techniques to represent text and provide predictions from text. The study protocol was registered in the PROSPERO database (CRD42022354602).RESULTS: Eighteen studies out of 2,825 were included. All studies used the publicly-available MIMIC dataset. Context-independent word embeddings are widely used. Model discrimination was provided by all studies (AUROC 0.75-0.96), but measures of calibration were scarce. Seven studies used both structural clinical variables and notes. Model discrimination improved when adding clinical notes to variables. None of the models was externally validated and often a simple train/test split was used for internal validation. Our critical appraisal demonstrated a high risk of bias in all studies and concerns regarding their applicability in clinical practice.CONCLUSION: All studies used a neural architecture for prediction and were based on one publicly available dataset. Clinical notes were reported to improve predictive performance when used in addition to only clinical variables. Most studies had methodological, reporting, and applicability issues. We recommend reporting both model discrimination and calibration, using additional data sources, and using more robust evaluation strategies, including prospective and external validation. Finally, sharing data and code is encouraged to improve study reproducibility.

AB - OBJECTIVE: To review and critically appraise published and preprint reports of prognostic models of in-hospital mortality of patients in the intensive-care unit (ICU) based on neural representations (embeddings) of clinical notes.METHODS: PubMed and arXiv were searched up to August 1, 2022. At least two reviewers independently selected the studies that developed a prognostic model of in-hospital mortality of intensive-care patients using free-text represented as embeddings and extracted data using the CHARMS checklist. Risk of bias was assessed using PROBAST. Reporting on the model was assessed with the TRIPOD guideline. To assess the machine learning components that were used in the models, we present a new descriptive framework based on different techniques to represent text and provide predictions from text. The study protocol was registered in the PROSPERO database (CRD42022354602).RESULTS: Eighteen studies out of 2,825 were included. All studies used the publicly-available MIMIC dataset. Context-independent word embeddings are widely used. Model discrimination was provided by all studies (AUROC 0.75-0.96), but measures of calibration were scarce. Seven studies used both structural clinical variables and notes. Model discrimination improved when adding clinical notes to variables. None of the models was externally validated and often a simple train/test split was used for internal validation. Our critical appraisal demonstrated a high risk of bias in all studies and concerns regarding their applicability in clinical practice.CONCLUSION: All studies used a neural architecture for prediction and were based on one publicly available dataset. Clinical notes were reported to improve predictive performance when used in addition to only clinical variables. Most studies had methodological, reporting, and applicability issues. We recommend reporting both model discrimination and calibration, using additional data sources, and using more robust evaluation strategies, including prospective and external validation. Finally, sharing data and code is encouraged to improve study reproducibility.

KW - Intensive care

KW - Machine learning

KW - Mortality

KW - Natural language processing

KW - Prognostic models

KW - Systematic review

UR - http://www.scopus.com/inward/record.url?scp=85172890680&partnerID=8YFLogxK

U2 - https://doi.org/10.1016/j.jbi.2023.104504

DO - https://doi.org/10.1016/j.jbi.2023.104504

M3 - Review article

C2 - 37742782

SN - 1532-0464

VL - 146

SP - 104504

JO - Journal of biomedical informatics

JF - Journal of biomedical informatics

M1 - 104504

ER -

Prognostic models of in-hospital mortality of intensive care patients using neural representation of unstructured text: a systematic review and critical appraisal

Abstract

Keywords

Access to Document

Other files and links

Cite this