Ontology highlight
ABSTRACT: Background
Machine learning models require large datasets that may be siloed across different health care institutions. Machine learning studies that focus on COVID-19 have been limited to single-hospital data, which limits model generalizability.Objective
We aimed to use federated learning, a machine learning technique that avoids locally aggregating raw clinical data across multiple institutions, to predict mortality in hospitalized patients with COVID-19 within 7 days.Methods
Patient data were collected from the electronic health records of 5 hospitals within the Mount Sinai Health System. Logistic regression with L1 regularization/least absolute shrinkage and selection operator (LASSO) and multilayer perceptron (MLP) models were trained by using local data at each site. We developed a pooled model with combined data from all 5 sites, and a federated model that only shared parameters with a central aggregator.Results
The LASSOfederated model outperformed the LASSOlocal model at 3 hospitals, and the MLPfederated model performed better than the MLPlocal model at all 5 hospitals, as determined by the area under the receiver operating characteristic curve. The LASSOpooled model outperformed the LASSOfederated model at all hospitals, and the MLPfederated model outperformed the MLPpooled model at 2 hospitals.Conclusions
The federated learning of COVID-19 electronic health record data shows promise in developing robust predictive models without compromising patient privacy.
SUBMITTER: Vaid A
PROVIDER: S-EPMC7842859 | biostudies-literature | 2021 Jan
REPOSITORIES: biostudies-literature
Vaid Akhil A Jaladanki Suraj K SK Xu Jie J Teng Shelly S Kumar Arvind A Lee Samuel S Somani Sulaiman S Paranjpe Ishan I De Freitas Jessica K JK Wanyan Tingyi T Johnson Kipp W KW Bicak Mesude M Klang Eyal E Kwon Young Joon YJ Costa Anthony A Zhao Shan S Miotto Riccardo R Charney Alexander W AW Böttinger Erwin E Fayad Zahi A ZA Nadkarni Girish N GN Wang Fei F Glicksberg Benjamin S BS
JMIR medical informatics 20210127 1
<h4>Background</h4>Machine learning models require large datasets that may be siloed across different health care institutions. Machine learning studies that focus on COVID-19 have been limited to single-hospital data, which limits model generalizability.<h4>Objective</h4>We aimed to use federated learning, a machine learning technique that avoids locally aggregating raw clinical data across multiple institutions, to predict mortality in hospitalized patients with COVID-19 within 7 days.<h4>Meth ...[more]