TY - GEN
T1 - Autoencoder-based prediction of ICU clinical codes
AU - Yordanov, T.R.
AU - Abu-Hanna, Ameen
AU - Ravelli, A.C.J.
AU - Vagliano, I.
N1 - Publisher Copyright: © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023/6/5
Y1 - 2023/6/5
N2 - Availability of diagnostic codes in Electronic Health Records (EHRs) is crucial for patient care as well as reimbursement purposes. However, entering them in the EHR is tedious, and some clinical codes may be overlooked. Given an incomplete list of clinical codes, we investigate the performance of ML methods on predicting the complete ones, and assess the added predictive value of including other clinical patient data in this task. We used the MIMIC-III dataset and frame the task of completing the clinical codes as a recommendation problem. We consider various autoencoder approaches plus two strong baselines; item co-occurrence and Singular Value Decomposition (SVD). Inputs are 1) a record’s known clinical codes, 2) the codes plus variables. The co-occurrence-based approach performed slightly better (F1 score = 0.26, Mean Average Precision [MAP] = 0.19) than the SVD (F1 = 0.24, MAP = 0.18). However, the adversarial autoencoder achieved the best performance when using the codes plus variables (F1 = 0.32, MAP = 0.25). Adversarial autoencoders performed best in terms of F1 and were equal to vanilla and denoising autoencoders in term of MAP. Using clinical variables in addition to the incomplete codes list, improves the predictive performance of the models.
AB - Availability of diagnostic codes in Electronic Health Records (EHRs) is crucial for patient care as well as reimbursement purposes. However, entering them in the EHR is tedious, and some clinical codes may be overlooked. Given an incomplete list of clinical codes, we investigate the performance of ML methods on predicting the complete ones, and assess the added predictive value of including other clinical patient data in this task. We used the MIMIC-III dataset and frame the task of completing the clinical codes as a recommendation problem. We consider various autoencoder approaches plus two strong baselines; item co-occurrence and Singular Value Decomposition (SVD). Inputs are 1) a record’s known clinical codes, 2) the codes plus variables. The co-occurrence-based approach performed slightly better (F1 score = 0.26, Mean Average Precision [MAP] = 0.19) than the SVD (F1 = 0.24, MAP = 0.18). However, the adversarial autoencoder achieved the best performance when using the codes plus variables (F1 = 0.32, MAP = 0.25). Adversarial autoencoders performed best in terms of F1 and were equal to vanilla and denoising autoencoders in term of MAP. Using clinical variables in addition to the incomplete codes list, improves the predictive performance of the models.
KW - Autoencoders
KW - Medical codes
KW - Prediction
KW - Recommender Systems
UR - http://www.scopus.com/inward/record.url?scp=85164010177&partnerID=8YFLogxK
U2 - https://doi.org/10.1007/978-3-031-34344-5_8
DO - https://doi.org/10.1007/978-3-031-34344-5_8
M3 - Conference contribution
SN - 9783031343438
VL - 13897
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 57
EP - 62
BT - Artificial Intelligence in Medicine - 21st International Conference on Artificial Intelligence in Medicine, AIME 2023, Proceedings
A2 - Juarez, Jose M.
A2 - Marcos, Mar
A2 - Stiglic, Gregor
A2 - Tucker, Allan
PB - Springer
T2 - 21st International Conference on Artificial Intelligence in Medicine, AIME 2023
Y2 - 12 June 2023 through 15 June 2023
ER -