Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression: Prediction Model Development Study

Dong Yun Lee; Byungjin Choi; Chungsoo Kim; Egill Fridgeirsson; Jenna Reps; Myoungsuk Kim; Jihyeong Kim; Jae-Won Jang; Sang Youl Rhee; Won-Woo Seo; Seunghoon Lee; Sang Joon Son; Rae Woong Park

doi:https://doi.org/10.2196/46165

Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression: Prediction Model Development Study

Dong Yun Lee, Byungjin Choi, Chungsoo Kim, Egill Fridgeirsson, Jenna Reps, Myoungsuk Kim, Jihyeong Kim, Jae-Won Jang, Sang Youl Rhee, Won-Woo Seo, Seunghoon Lee, Sang Joon Son, Rae Woong Park

Research output: Contribution to journal › Article › Academic › peer-review

2 Citations (Scopus)

Abstract

Background: Mood disorder has emerged as a serious concern for public health; in particular, bipolar disorder has a less favorable prognosis than depression. Although prompt recognition of depression conversion to bipolar disorder is needed, early prediction is challenging due to overlapping symptoms. Recently, there have been attempts to develop a prediction model by using federated learning. Federated learning in medical fields is a method for training multi-institutional machine learning models without patient-level data sharing. Objective: This study aims to develop and validate a federated, differentially private multi-institutional bipolar transition prediction model. Methods: This retrospective study enrolled patients diagnosed with the first depressive episode at 5 tertiary hospitals in South Korea. We developed models for predicting bipolar transition by using data from 17,631 patients in 4 institutions. Further, we used data from 4541 patients for external validation from 1 institution. We created standardized pipelines to extract large-scale clinical features from the 4 institutions without any code modification. Moreover, we performed feature selection in a federated environment for computational efficiency and applied differential privacy to gradient updates. Finally, we compared the federated and the 4 local models developed with each hospital's data on internal and external validation data sets. Results: In the internal data set, 279 out of 17,631 patients showed bipolar disorder transition. In the external data set, 39 out of 4541 patients showed bipolar disorder transition. The average performance of the federated model in the internal test (area under the curve [AUC] 0.726) and external validation (AUC 0.719) data sets was higher than that of the other locally developed models (AUC 0.642-0.707 and AUC 0.642-0.699, respectively). In the federated model, classifications were driven by several predictors such as the Charlson index (low scores were associated with bipolar transition, which may be due to younger age), severe depression, anxiolytics, young age, and visiting months (the bipolar transition was associated with seasonality, especially during the spring and summer months). Conclusions: We developed and validated a differentially private federated model by using distributed multi-institutional psychiatric data with standardized pipelines in a real-world environment. The federated model performed better than models using local data only.

Original language	English
Article number	e46165
Journal	Journal of Medical Internet Research
Volume	25
DOIs	https://doi.org/10.2196/46165
Publication status	Published - 2023

Keywords

bipolar disorder
data standardization
depression
differential privacy
federated learning

Access to Document

https://doi.org/10.2196/46165

Cite this

Lee, D. Y., Choi, B., Kim, C., Fridgeirsson, E., Reps, J., Kim, M., Kim, J., Jang, J.-W., Rhee, S. Y., Seo, W.-W., Lee, S., Son, S. J., & Park, R. W. (2023). Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression: Prediction Model Development Study. Journal of Medical Internet Research, 25, Article e46165. https://doi.org/10.2196/46165

@article{29be2c392acf4f9985c63dce06368fc9,

title = "Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression: Prediction Model Development Study",

abstract = "Background: Mood disorder has emerged as a serious concern for public health; in particular, bipolar disorder has a less favorable prognosis than depression. Although prompt recognition of depression conversion to bipolar disorder is needed, early prediction is challenging due to overlapping symptoms. Recently, there have been attempts to develop a prediction model by using federated learning. Federated learning in medical fields is a method for training multi-institutional machine learning models without patient-level data sharing. Objective: This study aims to develop and validate a federated, differentially private multi-institutional bipolar transition prediction model. Methods: This retrospective study enrolled patients diagnosed with the first depressive episode at 5 tertiary hospitals in South Korea. We developed models for predicting bipolar transition by using data from 17,631 patients in 4 institutions. Further, we used data from 4541 patients for external validation from 1 institution. We created standardized pipelines to extract large-scale clinical features from the 4 institutions without any code modification. Moreover, we performed feature selection in a federated environment for computational efficiency and applied differential privacy to gradient updates. Finally, we compared the federated and the 4 local models developed with each hospital's data on internal and external validation data sets. Results: In the internal data set, 279 out of 17,631 patients showed bipolar disorder transition. In the external data set, 39 out of 4541 patients showed bipolar disorder transition. The average performance of the federated model in the internal test (area under the curve [AUC] 0.726) and external validation (AUC 0.719) data sets was higher than that of the other locally developed models (AUC 0.642-0.707 and AUC 0.642-0.699, respectively). In the federated model, classifications were driven by several predictors such as the Charlson index (low scores were associated with bipolar transition, which may be due to younger age), severe depression, anxiolytics, young age, and visiting months (the bipolar transition was associated with seasonality, especially during the spring and summer months). Conclusions: We developed and validated a differentially private federated model by using distributed multi-institutional psychiatric data with standardized pipelines in a real-world environment. The federated model performed better than models using local data only.",

keywords = "bipolar disorder, data standardization, depression, differential privacy, federated learning",

author = "Lee, {Dong Yun} and Byungjin Choi and Chungsoo Kim and Egill Fridgeirsson and Jenna Reps and Myoungsuk Kim and Jihyeong Kim and Jae-Won Jang and Rhee, {Sang Youl} and Won-Woo Seo and Seunghoon Lee and Son, {Sang Joon} and Park, {Rae Woong}",

note = "Funding Information: This research was funded by a grant from the Korea Health Technology Research and Development Project through the Korea Health Industry Development Institute, funded by the Ministry of Health and Welfare, Republic of Korea (grant HR16C0001). Publisher Copyright: {\textcopyright} 2023 Journal of Medical Internet Research. All rights reserved.",

year = "2023",

doi = "https://doi.org/10.2196/46165",

language = "English",

volume = "25",

journal = "Journal of Medical Internet Research",

issn = "2291-5222",

publisher = "Journal of medical Internet Research",

}

TY - JOUR

T1 - Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression

T2 - Prediction Model Development Study

AU - Lee, Dong Yun

AU - Choi, Byungjin

AU - Kim, Chungsoo

AU - Fridgeirsson, Egill

AU - Reps, Jenna

AU - Kim, Myoungsuk

AU - Kim, Jihyeong

AU - Jang, Jae-Won

AU - Rhee, Sang Youl

AU - Seo, Won-Woo

AU - Lee, Seunghoon

AU - Son, Sang Joon

AU - Park, Rae Woong

N1 - Funding Information: This research was funded by a grant from the Korea Health Technology Research and Development Project through the Korea Health Industry Development Institute, funded by the Ministry of Health and Welfare, Republic of Korea (grant HR16C0001). Publisher Copyright: © 2023 Journal of Medical Internet Research. All rights reserved.

PY - 2023

Y1 - 2023

N2 - Background: Mood disorder has emerged as a serious concern for public health; in particular, bipolar disorder has a less favorable prognosis than depression. Although prompt recognition of depression conversion to bipolar disorder is needed, early prediction is challenging due to overlapping symptoms. Recently, there have been attempts to develop a prediction model by using federated learning. Federated learning in medical fields is a method for training multi-institutional machine learning models without patient-level data sharing. Objective: This study aims to develop and validate a federated, differentially private multi-institutional bipolar transition prediction model. Methods: This retrospective study enrolled patients diagnosed with the first depressive episode at 5 tertiary hospitals in South Korea. We developed models for predicting bipolar transition by using data from 17,631 patients in 4 institutions. Further, we used data from 4541 patients for external validation from 1 institution. We created standardized pipelines to extract large-scale clinical features from the 4 institutions without any code modification. Moreover, we performed feature selection in a federated environment for computational efficiency and applied differential privacy to gradient updates. Finally, we compared the federated and the 4 local models developed with each hospital's data on internal and external validation data sets. Results: In the internal data set, 279 out of 17,631 patients showed bipolar disorder transition. In the external data set, 39 out of 4541 patients showed bipolar disorder transition. The average performance of the federated model in the internal test (area under the curve [AUC] 0.726) and external validation (AUC 0.719) data sets was higher than that of the other locally developed models (AUC 0.642-0.707 and AUC 0.642-0.699, respectively). In the federated model, classifications were driven by several predictors such as the Charlson index (low scores were associated with bipolar transition, which may be due to younger age), severe depression, anxiolytics, young age, and visiting months (the bipolar transition was associated with seasonality, especially during the spring and summer months). Conclusions: We developed and validated a differentially private federated model by using distributed multi-institutional psychiatric data with standardized pipelines in a real-world environment. The federated model performed better than models using local data only.

AB - Background: Mood disorder has emerged as a serious concern for public health; in particular, bipolar disorder has a less favorable prognosis than depression. Although prompt recognition of depression conversion to bipolar disorder is needed, early prediction is challenging due to overlapping symptoms. Recently, there have been attempts to develop a prediction model by using federated learning. Federated learning in medical fields is a method for training multi-institutional machine learning models without patient-level data sharing. Objective: This study aims to develop and validate a federated, differentially private multi-institutional bipolar transition prediction model. Methods: This retrospective study enrolled patients diagnosed with the first depressive episode at 5 tertiary hospitals in South Korea. We developed models for predicting bipolar transition by using data from 17,631 patients in 4 institutions. Further, we used data from 4541 patients for external validation from 1 institution. We created standardized pipelines to extract large-scale clinical features from the 4 institutions without any code modification. Moreover, we performed feature selection in a federated environment for computational efficiency and applied differential privacy to gradient updates. Finally, we compared the federated and the 4 local models developed with each hospital's data on internal and external validation data sets. Results: In the internal data set, 279 out of 17,631 patients showed bipolar disorder transition. In the external data set, 39 out of 4541 patients showed bipolar disorder transition. The average performance of the federated model in the internal test (area under the curve [AUC] 0.726) and external validation (AUC 0.719) data sets was higher than that of the other locally developed models (AUC 0.642-0.707 and AUC 0.642-0.699, respectively). In the federated model, classifications were driven by several predictors such as the Charlson index (low scores were associated with bipolar transition, which may be due to younger age), severe depression, anxiolytics, young age, and visiting months (the bipolar transition was associated with seasonality, especially during the spring and summer months). Conclusions: We developed and validated a differentially private federated model by using distributed multi-institutional psychiatric data with standardized pipelines in a real-world environment. The federated model performed better than models using local data only.

KW - bipolar disorder

KW - data standardization

KW - depression

KW - differential privacy

KW - federated learning

UR - http://www.scopus.com/inward/record.url?scp=85165347899&partnerID=8YFLogxK

U2 - https://doi.org/10.2196/46165

DO - https://doi.org/10.2196/46165

M3 - Article

C2 - 37471130

SN - 2291-5222

VL - 25

JO - Journal of Medical Internet Research

JF - Journal of Medical Internet Research

M1 - e46165

ER -

Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression: Prediction Model Development Study

Abstract

Keywords

Access to Document

Other files and links

Cite this