Mortality Prediction Models with Clinical Notes Using Sparse Attention at the Word and Sentence Levels

Miguel Rios; Ameen Abu-Hanna

Mortality Prediction Models with Clinical Notes Using Sparse Attention at the Word and Sentence Levels

Medical Informatics (AMC)

Research output: Working paper › Preprint › Research

Abstract

Intensive Care in-hospital mortality prediction has various clinical applications. Neural prediction models, especially when capitalising on clinical notes, have been put forward as improvement on currently existing models. However, to be acceptable these models should be performant and transparent. This work studies different attention mechanisms for clinical neural prediction models in terms of their discrimination and calibration. Specifically, we investigate sparse attention as an alternative to dense attention weights in the task of in-hospital mortality prediction from clinical notes. We evaluate the attention mechanisms based on: i) local self-attention over words in a sentence, and ii) global self-attention with a transformer architecture across sentences. We demonstrate that the sparse mechanism approach outperforms the dense one for the local self-attention in terms of predictive performance with a publicly available dataset, and puts higher attention to prespecified relevant directive words. The performance at the sentence level, however, deteriorates as sentences including the influential directive words tend to be dropped all together.

Original language	Undefined/Unknown
Publication status	Published - 12 Dec 2022

Keywords

cs.CL
cs.LG

Access to Document

https://pure.amc.nl/ws/files/50271401/2212.06267v1.pdf

Cite this

@techreport{6bdd943fb9f7458bb0fd78e529709e63,

title = "Mortality Prediction Models with Clinical Notes Using Sparse Attention at the Word and Sentence Levels",

abstract = "Intensive Care in-hospital mortality prediction has various clinical applications. Neural prediction models, especially when capitalising on clinical notes, have been put forward as improvement on currently existing models. However, to be acceptable these models should be performant and transparent. This work studies different attention mechanisms for clinical neural prediction models in terms of their discrimination and calibration. Specifically, we investigate sparse attention as an alternative to dense attention weights in the task of in-hospital mortality prediction from clinical notes. We evaluate the attention mechanisms based on: i) local self-attention over words in a sentence, and ii) global self-attention with a transformer architecture across sentences. We demonstrate that the sparse mechanism approach outperforms the dense one for the local self-attention in terms of predictive performance with a publicly available dataset, and puts higher attention to prespecified relevant directive words. The performance at the sentence level, however, deteriorates as sentences including the influential directive words tend to be dropped all together.",

keywords = "cs.CL, cs.LG",

author = "Miguel Rios and Ameen Abu-Hanna",

note = "Technical Reports at the Department of Medical Informatics, Amsterdam UMC, 2021. https://kik.amc.nl/KIK/reports/TR2021-01.pdf",

year = "2022",

month = dec,

day = "12",

language = "Undefined/Unknown",

type = "WorkingPaper",

}

TY - UNPB

T1 - Mortality Prediction Models with Clinical Notes Using Sparse Attention at the Word and Sentence Levels

AU - Rios, Miguel

AU - Abu-Hanna, Ameen

N1 - Technical Reports at the Department of Medical Informatics, Amsterdam UMC, 2021. https://kik.amc.nl/KIK/reports/TR2021-01.pdf

PY - 2022/12/12

Y1 - 2022/12/12

N2 - Intensive Care in-hospital mortality prediction has various clinical applications. Neural prediction models, especially when capitalising on clinical notes, have been put forward as improvement on currently existing models. However, to be acceptable these models should be performant and transparent. This work studies different attention mechanisms for clinical neural prediction models in terms of their discrimination and calibration. Specifically, we investigate sparse attention as an alternative to dense attention weights in the task of in-hospital mortality prediction from clinical notes. We evaluate the attention mechanisms based on: i) local self-attention over words in a sentence, and ii) global self-attention with a transformer architecture across sentences. We demonstrate that the sparse mechanism approach outperforms the dense one for the local self-attention in terms of predictive performance with a publicly available dataset, and puts higher attention to prespecified relevant directive words. The performance at the sentence level, however, deteriorates as sentences including the influential directive words tend to be dropped all together.

AB - Intensive Care in-hospital mortality prediction has various clinical applications. Neural prediction models, especially when capitalising on clinical notes, have been put forward as improvement on currently existing models. However, to be acceptable these models should be performant and transparent. This work studies different attention mechanisms for clinical neural prediction models in terms of their discrimination and calibration. Specifically, we investigate sparse attention as an alternative to dense attention weights in the task of in-hospital mortality prediction from clinical notes. We evaluate the attention mechanisms based on: i) local self-attention over words in a sentence, and ii) global self-attention with a transformer architecture across sentences. We demonstrate that the sparse mechanism approach outperforms the dense one for the local self-attention in terms of predictive performance with a publicly available dataset, and puts higher attention to prespecified relevant directive words. The performance at the sentence level, however, deteriorates as sentences including the influential directive words tend to be dropped all together.

KW - cs.CL

KW - cs.LG

M3 - Preprint

BT - Mortality Prediction Models with Clinical Notes Using Sparse Attention at the Word and Sentence Levels

ER -