Unknown

Dataset Information

0

Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction.


ABSTRACT: Current approaches to predicting a cardiovascular disease (CVD) event rely on conventional risk factors and cross-sectional data. In this study, we applied machine learning and deep learning models to 10-year CVD event prediction by using longitudinal electronic health record (EHR) and genetic data. Our study cohort included 109, 490 individuals. In the first experiment, we extracted aggregated and longitudinal features from EHR. We applied logistic regression, random forests, gradient boosting trees, convolutional neural networks (CNN) and recurrent neural networks with long short-term memory (LSTM) units. In the second experiment, we applied a late-fusion approach to incorporate genetic features. We compared the performance with approaches currently utilized in routine clinical practice - American College of Cardiology and the American Heart Association (ACC/AHA) Pooled Cohort Risk Equation. Our results indicated that incorporating longitudinal feature lead to better event prediction. Combining genetic features through a late-fusion approach can further improve CVD prediction, underscoring the importance of integrating relevant genetic data whenever available.

SUBMITTER: Zhao J 

PROVIDER: S-EPMC6345960 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction.

Zhao Juan J   Feng QiPing Q   Wu Patrick P   Lupu Roxana A RA   Wilke Russell A RA   Wells Quinn S QS   Denny Joshua C JC   Wei Wei-Qi WQ  

Scientific reports 20190124 1


Current approaches to predicting a cardiovascular disease (CVD) event rely on conventional risk factors and cross-sectional data. In this study, we applied machine learning and deep learning models to 10-year CVD event prediction by using longitudinal electronic health record (EHR) and genetic data. Our study cohort included 109, 490 individuals. In the first experiment, we extracted aggregated and longitudinal features from EHR. We applied logistic regression, random forests, gradient boosting  ...[more]

Similar Datasets

| S-EPMC4081533 | biostudies-literature
| S-EPMC9779795 | biostudies-literature
| S-EPMC8722098 | biostudies-literature
| S-EPMC5380334 | biostudies-literature
| S-EPMC7532582 | biostudies-literature
| S-EPMC6080076 | biostudies-literature
| S-EPMC9834152 | biostudies-literature
| S-EPMC10844815 | biostudies-literature
| S-EPMC8411414 | biostudies-literature
| S-EPMC9106858 | biostudies-literature