Unknown

Dataset Information

0

Incremental data integration for tracking genotype-disease associations.


ABSTRACT: Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence comes to light. To address this challenge, we describe requirements for a framework for incremental data integration and propose an implementation based on phenotype ontologies and Bayesian probability updates. We apply the framework to quantify similarities between gene annotations and disease profiles. Within this scope, we categorize human diseases according to how well they can be recapitulated by animal models and quantify similarities between human diseases and mouse models produced by the International Mouse Phenotyping Consortium. The flexibility of the approach allows us to incorporate negative phenotypic data to better prioritize candidate genes, and to stratify disease mapping using sex-dependent phenotypes. All our association scores can be updated and we exploit this feature to showcase integration with curated annotations from high-precision assays. Incremental integration is thus a suitable framework for tracking functional annotations and linking to complex human pathology.

SUBMITTER: Konopka T 

PROVIDER: S-EPMC7004389 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Incremental data integration for tracking genotype-disease associations.

Konopka Tomasz T   Smedley Damian D  

PLoS computational biology 20200127 1


Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence comes to light. To address this challenge, we describe requirements for a framework for incremental data integration and propose an implementation based on phenotype ontologies and Bayesian probabilit  ...[more]

Similar Datasets

| S-EPMC1630430 | biostudies-literature
| S-EPMC6954643 | biostudies-literature
| S-EPMC5977601 | biostudies-literature
| S-EPMC4743935 | biostudies-literature
| S-EPMC3980383 | biostudies-literature
| S-EPMC4313847 | biostudies-literature
| S-EPMC9216524 | biostudies-literature
| S-EPMC8741210 | biostudies-literature
| S-EPMC5546495 | biostudies-other
| S-EPMC10712715 | biostudies-literature