Dataset Information

Interpretation of personal genome sequencing data in terms of disease ranks based on mutual information.

ABSTRACT: The rapid advances in genome sequencing technologies have resulted in an unprecedented number of genome variations being discovered in humans. However, there has been very limited coverage of interpretation of the personal genome sequencing data in terms of diseases.In this paper we present the first computational analysis scheme for interpreting personal genome data by simultaneously considering the functional impact of damaging variants and curated disease-gene association data. This method is based on mutual information as a measure of the relative closeness between the personal genome and diseases. We hypothesize that a higher mutual information score implies that the personal genome is more susceptible to a particular disease than other diseases.The method was applied to the sequencing data of 50 acute myeloid leukemia (AML) patients in The Cancer Genome Atlas. The utility of associations between a disease and the personal genome was explored using data of healthy (control) people obtained from the 1000 Genomes Project. The ranks of the disease terms in the AML patient group were compared with those in the healthy control group using "Leukemia, Myeloid, Acute" (C04.557.337.539.550) as the corresponding MeSH disease term.Overall, the area under the receiver operating characteristics curve was significantly larger for the AML patient data than for the healthy controls. This methodology could contribute to consequential discoveries and explanations for mining personal genome sequencing data in terms of diseases, and have versatility with respect to genomic-based knowledge such as drug-gene and environmental-factor-gene interactions.

SUBMITTER: Na YJ

PROVIDER: S-EPMC4460593 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Interpretation of personal genome sequencing data in terms of disease ranks based on mutual information.

Na Young-Ji YJ Sohn Kyung-Ah KA Kim Ju Han JH

BMC medical genomics 20150529

<h4>Background</h4>The rapid advances in genome sequencing technologies have resulted in an unprecedented number of genome variations being discovered in humans. However, there has been very limited coverage of interpretation of the personal genome sequencing data in terms of diseases.<h4>Methods</h4>In this paper we present the first computational analysis scheme for interpreting personal genome data by simultaneously considering the functional impact of damaging variants and curated disease-ge ...[more]

PMID: 26045178

Dataset Information

Interpretation of personal genome sequencing data in terms of disease ranks based on mutual information.

Publications

Interpretation of personal genome sequencing data in terms of disease ranks based on mutual information.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Reconstruction of the personal information from human genome reads in gut metagenome sequencing data.
| S-EPMC10234815 | biostudies-literature

Co-localization between Sequence Constraint and Epigenomic Information Improves Interpretation of Whole-Genome Sequencing Data.
| S-EPMC7118583 | biostudies-literature

SpeedSeq: ultra-fast personal genome analysis and interpretation.
| S-EPMC4589466 | biostudies-literature

Mutual information between discrete and continuous data sets.
| S-EPMC3929353 | biostudies-literature

Mutual information: Measuring nonlinear dependence in longitudinal epidemiological data.
| S-EPMC10132663 | biostudies-literature

MIAMI: Mutual Information-based Analysis of Multiplex Imaging data.
| S-EPMC9344855 | biostudies-literature

Personal Cancer Genome Reporter: variant interpretation report for precision oncology.
| S-EPMC5946881 | biostudies-literature

SNPedia: a wiki supporting personal genome annotation, interpretation and analysis.
| S-EPMC3245045 | biostudies-literature

Genome-wide epistasis and co-selection study using mutual information.
| S-EPMC6765119 | biostudies-literature

Interpretome: a freely available, modular, and secure personal genome interpretation engine.
| S-EPMC4809242 | biostudies-literature