Attention-based neural networks for clinical prediction modelling on electronic health records

Egill A. Fridgeirsson; David Sontag; Peter Rijnbeek

doi:https://doi.org/10.1186/s12874-023-02112-2

Attention-based neural networks for clinical prediction modelling on electronic health records

Egill A. Fridgeirsson, David Sontag, Peter Rijnbeek

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Background: Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. Methods: We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. Results: Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. Conclusion: In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.

Original language	English
Article number	285
Journal	BMC medical research methodology
Volume	23
Issue number	1
DOIs	https://doi.org/10.1186/s12874-023-02112-2
Publication status	Published - 1 Dec 2023

Keywords

Clinical prediction models
Deep learning
Electronic health records

Access to Document

https://doi.org/10.1186/s12874-023-02112-2

Cite this

@article{353ed12baa7b4299a750c40ec6b94b8e,

title = "Attention-based neural networks for clinical prediction modelling on electronic health records",

abstract = "Background: Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. Methods: We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. Results: Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. Conclusion: In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.",

keywords = "Clinical prediction models, Deep learning, Electronic health records",

author = "Fridgeirsson, {Egill A.} and David Sontag and Peter Rijnbeek",

note = "Funding Information: This project has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No 806968. The JU receives support from the European Union{\textquoteright}s Horizon 2020 research and innovation programme and EFPIA. Funding Information: DS was a consultant for and has equity in CureAI and ASAPP and has received compensation from speaking at Genentech. DS has a grant from Takeda. EF and PR work for a research group who received unconditional research grants from Boehringer-Ingelheim, GSK, Janssen Research & Development, Novartis, Pfizer, Yamanouchi, Servier. None of these grants result in a conflict of interest to the content of this paper. Publisher Copyright: {\textcopyright} 2023, The Author(s).",

year = "2023",

month = dec,

day = "1",

doi = "https://doi.org/10.1186/s12874-023-02112-2",

language = "English",

volume = "23",

journal = "BMC medical research methodology",

issn = "1471-2288",

publisher = "BioMed Central",

number = "1",

}

TY - JOUR

T1 - Attention-based neural networks for clinical prediction modelling on electronic health records

AU - Fridgeirsson, Egill A.

AU - Sontag, David

AU - Rijnbeek, Peter

N1 - Funding Information: This project has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No 806968. The JU receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA. Funding Information: DS was a consultant for and has equity in CureAI and ASAPP and has received compensation from speaking at Genentech. DS has a grant from Takeda. EF and PR work for a research group who received unconditional research grants from Boehringer-Ingelheim, GSK, Janssen Research & Development, Novartis, Pfizer, Yamanouchi, Servier. None of these grants result in a conflict of interest to the content of this paper. Publisher Copyright: © 2023, The Author(s).

PY - 2023/12/1

Y1 - 2023/12/1

N2 - Background: Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. Methods: We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. Results: Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. Conclusion: In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.

AB - Background: Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. Methods: We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. Results: Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. Conclusion: In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.

KW - Clinical prediction models

KW - Deep learning

KW - Electronic health records

UR - http://www.scopus.com/inward/record.url?scp=85178898493&partnerID=8YFLogxK

U2 - https://doi.org/10.1186/s12874-023-02112-2

DO - https://doi.org/10.1186/s12874-023-02112-2

M3 - Article

C2 - 38062352

SN - 1471-2288

VL - 23

JO - BMC medical research methodology

JF - BMC medical research methodology

IS - 1

M1 - 285

ER -

Attention-based neural networks for clinical prediction modelling on electronic health records

Abstract

Keywords

Access to Document

Other files and links

Cite this