Unknown

Dataset Information

0

Phenotype-optimized sequence ensembles substantially improve prediction of disease-causing mutation in cystic fibrosis.


ABSTRACT: Cystic fibrosis transmembrane conductance regulator (CFTR) mutation is associated with a phenotypic spectrum that includes cystic fibrosis (CF). The disease liability of some common CFTR mutations is known, but rare mutations are seen in too few patients to categorize unequivocally, making genetic diagnosis difficult. Computational methods can predict the impact of mutation, but prediction specificity is often below that required for clinical utility. Here, we present a novel supervised learning approach for predicting CF from CFTR missense mutation. The algorithm begins by constructing custom multiple sequence alignments called phenotype-optimized sequence ensembles (POSEs). POSEs are constructed iteratively, by selecting sequences that optimize predictive performance on a training set of CFTR mutations of known clinical significance. Next, we predict CF disease liability from a different set of CFTR mutations (test-set mutations). This approach achieves improved prediction performance relative to popular methods recently assessed using the same test-set mutations. Of clinical significance, our method achieves 94% prediction specificity. Because databases such as HGMD and locus-specific mutation databases are growing rapidly, methods that automatically tailor their predictions for a specific phenotype may be of immediate utility. If the performance achieved here generalizes to other systems, the approach could be an excellent tool to help establish genetic diagnoses.

SUBMITTER: Masica DL 

PROVIDER: S-EPMC4364283 | biostudies-literature | 2012 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phenotype-optimized sequence ensembles substantially improve prediction of disease-causing mutation in cystic fibrosis.

Masica David L DL   Sosnay Patrick R PR   Cutting Garry R GR   Karchin Rachel R  

Human mutation 20120522 8


Cystic fibrosis transmembrane conductance regulator (CFTR) mutation is associated with a phenotypic spectrum that includes cystic fibrosis (CF). The disease liability of some common CFTR mutations is known, but rare mutations are seen in too few patients to categorize unequivocally, making genetic diagnosis difficult. Computational methods can predict the impact of mutation, but prediction specificity is often below that required for clinical utility. Here, we present a novel supervised learning  ...[more]

Similar Datasets

| S-EPMC2975206 | biostudies-literature
| S-EPMC6996396 | biostudies-literature
| 2726863 | ecrin-mdr-crc
| S-EPMC2945989 | biostudies-literature
| S-EPMC4789841 | biostudies-literature
| S-EPMC4582929 | biostudies-literature
2007-05-21 | GSE5715 | GEO
| S-EPMC3667188 | biostudies-literature
| S-EPMC5836842 | biostudies-literature
| S-EPMC6339265 | biostudies-literature