Unknown

Dataset Information

0

Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis.


ABSTRACT:

Purpose

Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and insufficient utilization of expert knowledge in clinical genetics. To overcome these difficulties, we present a novel method called Xrare for the prioritization of causative gene variants in rare disease diagnosis.

Methods

We propose a new phenotype similarity scoring method called Emission-Reception Information Content (ERIC), which is highly tolerant of noise and imprecision in clinical phenotypes. We utilize medical genetic domain knowledge by designing genetic features implementing American College of Medical Genetics and Genomics (ACMG) guidelines.

Results

ERIC score ranked consistently higher for disease genes than other phenotypic similarity scores in the presence of imprecise and noisy phenotypes. Extensive simulations and real clinical data demonstrated that Xrare outperforms existing alternative methods by 10-40% at various genetic diagnosis scenarios.

Conclusion

The Xrare model is learned from a large database of clinical variants, and derives its strength from the tight integration of medical genetics features and phenotypic features similarity scores. Xrare provides the clinical community with a robust and powerful tool for variant prioritization.

SUBMITTER: Li Q 

PROVIDER: S-EPMC6752318 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis.

Li Qigang Q   Zhao Keyan K   Bustamante Carlos D CD   Ma Xin X   Wong Wing H WH  

Genetics in medicine : official journal of the American College of Medical Genetics 20190124 9


<h4>Purpose</h4>Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and insufficient utilization of expert knowledge in clinical genetics. To overcome these difficulties, we present a novel method called Xrare for the prioritization of causative gene variants in ra  ...[more]

Similar Datasets

| S-EPMC7122481 | biostudies-literature
| S-EPMC3572743 | biostudies-literature
| S-EPMC6288202 | biostudies-literature
| S-EPMC7319603 | biostudies-literature
| S-EPMC9579891 | biostudies-literature
| S-EPMC2954823 | biostudies-literature
| S-EPMC10322212 | biostudies-literature
| S-EPMC4871978 | biostudies-other
| S-EPMC6500604 | biostudies-other
| S-EPMC8050752 | biostudies-literature