Unknown

Dataset Information

0

Using Multi-Modal Electronic Health Record Data for the Development and Validation of Risk Prediction Models for Long COVID Using the Super Learner Algorithm.


ABSTRACT:

Background

Post-Acute Sequelae of COVID-19 (PASC) have emerged as a global public health and healthcare challenge. This study aimed to uncover predictive factors for PASC from multi-modal data to develop a predictive model for PASC diagnoses.

Methods

We analyzed electronic health records from 92,301 COVID-19 patients, covering medical phenotypes, medications, and lab results. We used a Super Learner-based prediction approach to identify predictive factors. We integrated the model outputs into individual and composite risk scores and evaluated their predictive performance.

Results

Our analysis identified several factors predictive of diagnoses of PASC, including being overweight/obese and the use of HMG CoA reductase inhibitors prior to COVID-19 infection, and respiratory system symptoms during COVID-19 infection. We developed a composite risk score with a moderate discriminatory ability for PASC (covariate-adjusted AUC (95% confidence interval): 0.66 (0.63, 0.69)) by combining the risk scores based on phenotype and medication records. The combined risk score could identify 10% of individuals with a 2.2-fold increased risk for PASC.

Conclusions

We identified several factors predictive of diagnoses of PASC and integrated the information into a composite risk score for PASC prediction, which could contribute to the identification of individuals at higher risk for PASC and inform preventive efforts.

SUBMITTER: Jin W 

PROVIDER: S-EPMC10707399 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using Multi-Modal Electronic Health Record Data for the Development and Validation of Risk Prediction Models for Long COVID Using the Super Learner Algorithm.

Jin Weijia W   Hao Wei W   Shi Xu X   Fritsche Lars G LG   Salvatore Maxwell M   Admon Andrew J AJ   Friese Christopher R CR   Mukherjee Bhramar B  

Journal of clinical medicine 20231125 23


<h4>Background</h4>Post-Acute Sequelae of COVID-19 (PASC) have emerged as a global public health and healthcare challenge. This study aimed to uncover predictive factors for PASC from multi-modal data to develop a predictive model for PASC diagnoses.<h4>Methods</h4>We analyzed electronic health records from 92,301 COVID-19 patients, covering medical phenotypes, medications, and lab results. We used a Super Learner-based prediction approach to identify predictive factors. We integrated the model  ...[more]

Similar Datasets

| S-EPMC10160242 | biostudies-literature
| S-EPMC9728836 | biostudies-literature
| S-EPMC6080076 | biostudies-literature
| S-EPMC9930806 | biostudies-literature
| S-EPMC7929752 | biostudies-literature
| S-EPMC10309999 | biostudies-literature
| S-EPMC6324323 | biostudies-literature
| S-EPMC7053021 | biostudies-literature
| S-EPMC8931647 | biostudies-literature
| S-EPMC9344797 | biostudies-literature