TY - GEN
T1 - Trust Issues: Uncertainty Estimation Does Not Enable Reliable OOD Detection On Medical Tabular Data
AU - Ulmer, Dennis
AU - Meijerink, Lotta
AU - Cinà, Giovanni
N1 - Publisher Copyright: © 2020 D. Ulmer, L. Meijerink & G. Cinà.
PY - 2020
Y1 - 2020
N2 - When deploying machine learning models in high-stakes real-world environments such as health care, it is crucial to accurately assess the uncertainty concerning a model’s prediction on abnormal inputs. However, there is a scarcity of literature analyzing this problem on medical data, especially on mixed-type tabular data such as Electronic Health Records. We close this gap by presenting a series of tests including a large variety of contemporary uncertainty estimation techniques, in order to determine whether they are able to identify out-of-distribution (OOD) patients. In contrast to previous work, we design tests on realistic and clinically relevant OOD groups, and run experiments on real-world medical data. We find that almost all techniques fail to achieve convincing results, partly disagreeing with earlier findings.
AB - When deploying machine learning models in high-stakes real-world environments such as health care, it is crucial to accurately assess the uncertainty concerning a model’s prediction on abnormal inputs. However, there is a scarcity of literature analyzing this problem on medical data, especially on mixed-type tabular data such as Electronic Health Records. We close this gap by presenting a series of tests including a large variety of contemporary uncertainty estimation techniques, in order to determine whether they are able to identify out-of-distribution (OOD) patients. In contrast to previous work, we design tests on realistic and clinically relevant OOD groups, and run experiments on real-world medical data. We find that almost all techniques fail to achieve convincing results, partly disagreeing with earlier findings.
KW - Electronic Health Records
KW - OOD Detection
KW - Uncertainty Estimation
UR - http://www.scopus.com/inward/record.url?scp=85104611252&partnerID=8YFLogxK
M3 - Conference contribution
VL - 136
T3 - Proceedings of Machine Learning Research
SP - 341
EP - 354
BT - Proceedings of Machine Learning Research
T2 - 6th Workshop on Machine Learning for Health: Advancing Healthcare for All, ML4H 2020, in conjunction with the 34th Conference on Neural Information Processing Systems, NeurIPS 2020
Y2 - 11 December 2020
ER -