Local and Distributed Machine Learning for Inter-hospital Data Utilization: An Application for TAVI Outcome Prediction

Ricardo R. Lopes; Marco Mamprin; Jo M. Zelis; Pim A. L. Tonino; Martijn S. van Mourik; Marije M. Vis; Svitlana Zinger; Bas A. J. M. de Mol; Peter H. N. de With; Henk A. Marquering

doi:https://doi.org/10.3389/fcvm.2021.787246

Local and Distributed Machine Learning for Inter-hospital Data Utilization: An Application for TAVI Outcome Prediction

Ricardo R. Lopes, Marco Mamprin, Jo M. Zelis, Pim A. L. Tonino, Martijn S. van Mourik, Marije M. Vis, Svitlana Zinger, Bas A. J. M. de Mol, Peter H. N. de With, Henk A. Marquering

Research output: Contribution to journal › Article › Academic › peer-review

2 Citations (Scopus)

Abstract

Background: Machine learning models have been developed for numerous medical prognostic purposes. These models are commonly developed using data from single centers or regional registries. Including data from multiple centers improves robustness and accuracy of prognostic models. However, data sharing between multiple centers is complex, mainly because of regulations and patient privacy issues. Objective: We aim to overcome data sharing impediments by using distributed ML and local learning followed by model integration. We applied these techniques to develop 1-year TAVI mortality estimation models with data from two centers without sharing any data. Methods: A distributed ML technique and local learning followed by model integration was used to develop models to predict 1-year mortality after TAVI. We included two populations with 1,160 (Center A) and 631 (Center B) patients. Five traditional ML algorithms were implemented. The results were compared to models created individually on each center. Results: The combined learning techniques outperformed the mono-center models. For center A, the combined local XGBoost achieved an AUC of 0.67 (compared to a mono-center AUC of 0.65) and, for center B, a distributed neural network achieved an AUC of 0.68 (compared to a mono-center AUC of 0.64). Conclusion: This study shows that distributed ML and combined local models techniques, can overcome data sharing limitations and result in more accurate models for TAVI mortality estimation. We have shown improved prognostic accuracy for both centers and can also be used as an alternative to overcome the problem of limited amounts of data when creating prognostic models.

Original language	English
Article number	787246
Pages (from-to)	787246
Journal	Frontiers in cardiovascular medicine
Volume	8
DOIs	https://doi.org/10.3389/fcvm.2021.787246
Publication status	Published - 2021

Access to Document

https://doi.org/10.3389/fcvm.2021.787246

Cite this

Lopes, R. R., Mamprin, M., Zelis, J. M., Tonino, P. A. L., van Mourik, M. S., Vis, M. M., Zinger, S., de Mol, B. A. J. M., de With, P. H. N., & Marquering, H. A. (2021). Local and Distributed Machine Learning for Inter-hospital Data Utilization: An Application for TAVI Outcome Prediction. Frontiers in cardiovascular medicine, 8, 787246. Article 787246. https://doi.org/10.3389/fcvm.2021.787246

@article{d2bd48a461234876a54658e179eb8d8e,

title = "Local and Distributed Machine Learning for Inter-hospital Data Utilization: An Application for TAVI Outcome Prediction",

abstract = "Background: Machine learning models have been developed for numerous medical prognostic purposes. These models are commonly developed using data from single centers or regional registries. Including data from multiple centers improves robustness and accuracy of prognostic models. However, data sharing between multiple centers is complex, mainly because of regulations and patient privacy issues. Objective: We aim to overcome data sharing impediments by using distributed ML and local learning followed by model integration. We applied these techniques to develop 1-year TAVI mortality estimation models with data from two centers without sharing any data. Methods: A distributed ML technique and local learning followed by model integration was used to develop models to predict 1-year mortality after TAVI. We included two populations with 1,160 (Center A) and 631 (Center B) patients. Five traditional ML algorithms were implemented. The results were compared to models created individually on each center. Results: The combined learning techniques outperformed the mono-center models. For center A, the combined local XGBoost achieved an AUC of 0.67 (compared to a mono-center AUC of 0.65) and, for center B, a distributed neural network achieved an AUC of 0.68 (compared to a mono-center AUC of 0.64). Conclusion: This study shows that distributed ML and combined local models techniques, can overcome data sharing limitations and result in more accurate models for TAVI mortality estimation. We have shown improved prognostic accuracy for both centers and can also be used as an alternative to overcome the problem of limited amounts of data when creating prognostic models.",

author = "Lopes, {Ricardo R.} and Marco Mamprin and Zelis, {Jo M.} and Tonino, {Pim A. L.} and {van Mourik}, {Martijn S.} and Vis, {Marije M.} and Svitlana Zinger and {de Mol}, {Bas A. J. M.} and {de With}, {Peter H. N.} and Marquering, {Henk A.}",

note = "Copyright {\textcopyright} 2021 Lopes, Mamprin, Zelis, Tonino, van Mourik, Vis, Zinger, de Mol, de With and Marquering.",

year = "2021",

doi = "https://doi.org/10.3389/fcvm.2021.787246",

language = "English",

volume = "8",

pages = "787246",

journal = "Frontiers in cardiovascular medicine",

issn = "2297-055X",

publisher = "Frontiers Media S.A.",

}

TY - JOUR

T1 - Local and Distributed Machine Learning for Inter-hospital Data Utilization

T2 - An Application for TAVI Outcome Prediction

AU - Lopes, Ricardo R.

AU - Mamprin, Marco

AU - Zelis, Jo M.

AU - Tonino, Pim A. L.

AU - van Mourik, Martijn S.

AU - Vis, Marije M.

AU - Zinger, Svitlana

AU - de Mol, Bas A. J. M.

AU - de With, Peter H. N.

AU - Marquering, Henk A.

PY - 2021

Y1 - 2021

N2 - Background: Machine learning models have been developed for numerous medical prognostic purposes. These models are commonly developed using data from single centers or regional registries. Including data from multiple centers improves robustness and accuracy of prognostic models. However, data sharing between multiple centers is complex, mainly because of regulations and patient privacy issues. Objective: We aim to overcome data sharing impediments by using distributed ML and local learning followed by model integration. We applied these techniques to develop 1-year TAVI mortality estimation models with data from two centers without sharing any data. Methods: A distributed ML technique and local learning followed by model integration was used to develop models to predict 1-year mortality after TAVI. We included two populations with 1,160 (Center A) and 631 (Center B) patients. Five traditional ML algorithms were implemented. The results were compared to models created individually on each center. Results: The combined learning techniques outperformed the mono-center models. For center A, the combined local XGBoost achieved an AUC of 0.67 (compared to a mono-center AUC of 0.65) and, for center B, a distributed neural network achieved an AUC of 0.68 (compared to a mono-center AUC of 0.64). Conclusion: This study shows that distributed ML and combined local models techniques, can overcome data sharing limitations and result in more accurate models for TAVI mortality estimation. We have shown improved prognostic accuracy for both centers and can also be used as an alternative to overcome the problem of limited amounts of data when creating prognostic models.

AB - Background: Machine learning models have been developed for numerous medical prognostic purposes. These models are commonly developed using data from single centers or regional registries. Including data from multiple centers improves robustness and accuracy of prognostic models. However, data sharing between multiple centers is complex, mainly because of regulations and patient privacy issues. Objective: We aim to overcome data sharing impediments by using distributed ML and local learning followed by model integration. We applied these techniques to develop 1-year TAVI mortality estimation models with data from two centers without sharing any data. Methods: A distributed ML technique and local learning followed by model integration was used to develop models to predict 1-year mortality after TAVI. We included two populations with 1,160 (Center A) and 631 (Center B) patients. Five traditional ML algorithms were implemented. The results were compared to models created individually on each center. Results: The combined learning techniques outperformed the mono-center models. For center A, the combined local XGBoost achieved an AUC of 0.67 (compared to a mono-center AUC of 0.65) and, for center B, a distributed neural network achieved an AUC of 0.68 (compared to a mono-center AUC of 0.64). Conclusion: This study shows that distributed ML and combined local models techniques, can overcome data sharing limitations and result in more accurate models for TAVI mortality estimation. We have shown improved prognostic accuracy for both centers and can also be used as an alternative to overcome the problem of limited amounts of data when creating prognostic models.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85159625685&origin=inward

U2 - https://doi.org/10.3389/fcvm.2021.787246

DO - https://doi.org/10.3389/fcvm.2021.787246

M3 - Article

C2 - 34869698

SN - 2297-055X

VL - 8

SP - 787246

JO - Frontiers in cardiovascular medicine

JF - Frontiers in cardiovascular medicine

M1 - 787246

ER -

Local and Distributed Machine Learning for Inter-hospital Data Utilization: An Application for TAVI Outcome Prediction

Abstract

Access to Document

Other files and links

Cite this