Unknown

Dataset Information

0

Sequence-based prioritization of nonsynonymous single-nucleotide polymorphisms for the study of disease mutations.


ABSTRACT: The increasing demand for the identification of genetic variation responsible for common diseases has translated into a need for sophisticated methods for effectively prioritizing mutations occurring in disease-associated genetic regions. In this article, we prioritize candidate nonsynonymous single-nucleotide polymorphisms (nsSNPs) through a bioinformatics approach that takes advantages of a set of improved numeric features derived from protein-sequence information and a new statistical learning model called "multiple selection rule voting" (MSRV). The sequence-based features can maximize the scope of applications of our approach, and the MSRV model can capture subtle characteristics of individual mutations. Systematic validation of the approach demonstrates that this approach is capable of prioritizing causal mutations for both simple monogenic diseases and complex polygenic diseases. Further studies of familial Alzheimer diseases and diabetes show that the approach can enrich mutations underlying these polygenic diseases among the top of candidate mutations. Application of this approach to unclassified mutations suggests that there are 10 suspicious mutations likely to cause diseases, and there is strong support for this in the literature.

SUBMITTER: Jiang R 

PROVIDER: S-EPMC1950793 | biostudies-literature | 2007 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence-based prioritization of nonsynonymous single-nucleotide polymorphisms for the study of disease mutations.

Jiang Rui R   Yang Hua H   Zhou Linqi L   Kuo C-C Jay CC   Sun Fengzhu F   Chen Ting T  

American journal of human genetics 20070622 2


The increasing demand for the identification of genetic variation responsible for common diseases has translated into a need for sophisticated methods for effectively prioritizing mutations occurring in disease-associated genetic regions. In this article, we prioritize candidate nonsynonymous single-nucleotide polymorphisms (nsSNPs) through a bioinformatics approach that takes advantages of a set of improved numeric features derived from protein-sequence information and a new statistical learnin  ...[more]

Similar Datasets

| S-EPMC4592972 | biostudies-literature
| S-EPMC3399991 | biostudies-literature
| S-EPMC4420795 | biostudies-literature
| S-EPMC4061446 | biostudies-literature
| S-EPMC3827885 | biostudies-other
| S-EPMC6174354 | biostudies-literature