Unknown

Dataset Information

0

Large-Scale Discovery of Disease-Disease and Disease-Gene Associations.


ABSTRACT: Data-driven phenotype analyses on Electronic Health Record (EHR) data have recently drawn benefits across many areas of clinical practice, uncovering new links in the medical sciences that can potentially affect the well-being of millions of patients. In this paper, EHR data is used to discover novel relationships between diseases by studying their comorbidities (co-occurrences in patients). A novel embedding model is designed to extract knowledge from disease comorbidities by learning from a large-scale EHR database comprising more than 35 million inpatient cases spanning nearly a decade, revealing significant improvements on disease phenotyping over current computational approaches. In addition, the use of the proposed methodology is extended to discover novel disease-gene associations by including valuable domain knowledge from genome-wide association studies. To evaluate our approach, its effectiveness is compared against a held-out set where, again, it revealed very compelling results. For selected diseases, we further identify candidate gene lists for which disease-gene associations were not studied previously. Thus, our approach provides biomedical researchers with new tools to filter genes of interest, thus, reducing costly lab studies.

SUBMITTER: Gligorijevic D 

PROVIDER: S-EPMC5006166 | biostudies-literature | 2016 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Large-Scale Discovery of Disease-Disease and Disease-Gene Associations.

Gligorijevic Djordje D   Stojanovic Jelena J   Djuric Nemanja N   Radosavljevic Vladan V   Grbovic Mihajlo M   Kulathinal Rob J RJ   Obradovic Zoran Z  

Scientific reports 20160831


Data-driven phenotype analyses on Electronic Health Record (EHR) data have recently drawn benefits across many areas of clinical practice, uncovering new links in the medical sciences that can potentially affect the well-being of millions of patients. In this paper, EHR data is used to discover novel relationships between diseases by studying their comorbidities (co-occurrences in patients). A novel embedding model is designed to extract knowledge from disease comorbidities by learning from a la  ...[more]

Similar Datasets

| S-EPMC7038619 | biostudies-literature
| S-EPMC10567571 | biostudies-literature
| S-EPMC1557754 | biostudies-literature
| S-EPMC1401514 | biostudies-literature
| S-EPMC5357838 | biostudies-literature
| S-EPMC4576452 | biostudies-literature
| S-EPMC6671969 | biostudies-literature
2011-12-04 | E-GEOD-32587 | biostudies-arrayexpress
2011-12-04 | GSE32587 | GEO
| S-EPMC2745341 | biostudies-literature