Unknown

Dataset Information

0

Knowledge discovery in variant databases using inductive logic programming.


ABSTRACT: Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/.

SUBMITTER: Nguyen H 

PROVIDER: S-EPMC3615990 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Knowledge discovery in variant databases using inductive logic programming.

Nguyen Hoan H   Luu Tien-Dao TD   Poch Olivier O   Thompson Julie D JD  

Bioinformatics and biology insights 20130318


Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped  ...[more]

Similar Datasets

| S-EPMC4190110 | biostudies-literature
| S-EPMC3913550 | biostudies-literature
| S-EPMC3458898 | biostudies-literature
| S-EPMC3078102 | biostudies-literature
| S-EPMC7272073 | biostudies-literature
2023-08-15 | GSE239996 | GEO
| S-EPMC3394327 | biostudies-literature
| S-EPMC3753570 | biostudies-literature
| S-EPMC4016706 | biostudies-literature
| S-EPMC549560 | biostudies-literature