Unknown

Dataset Information

0

Regulatory Single-Nucleotide Variant Predictor Increases Predictive Performance of Functional Regulatory Variants.


ABSTRACT: In silico methods for detecting functionally relevant genetic variants are important for identifying genetic markers of human inherited disease. Much research has focused on protein-coding variants since coding regions have well-defined physicochemical and functional properties. However, many bioinformatics tools are not applicable to variants outside coding regions. Here, we increase the classification performance of our regulatory single-nucleotide variant predictor (RSVP) for variants that cause regulatory abnormalities from an AUC of 0.90-0.97 by incorporating genomic regions identified by the ENCODE project into RSVP. RSVP is comparable to a recently published tool, Genome-Wide Annotation of Variants (GWAVA); both RSVP and GWAVA perform better on regulatory variants than a traditional variant predictor, combined annotation-dependent depletion (CADD). However, our method outperforms GWAVA on variants located at similar distances to the transcription start site as the positive set (AUC: 0.96) as compared with GWAVA (AUC: 0.71). Much of this disparity is due to RSVP's incorporation of features pertaining to the nearest gene (expression, GO terms, etc.), which are not included in GWAVA. Our findings hold out the promise of a framework for the assessment of all functional regulatory variants, providing a means to predict which rare or de novo variants are of pathogenic significance.

SUBMITTER: Peterson TA 

PROVIDER: S-EPMC6192032 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Regulatory Single-Nucleotide Variant Predictor Increases Predictive Performance of Functional Regulatory Variants.

Peterson Thomas A TA   Mort Matthew M   Cooper David N DN   Radivojac Predrag P   Kann Maricel G MG   Mooney Sean D SD  

Human mutation 20160831 11


In silico methods for detecting functionally relevant genetic variants are important for identifying genetic markers of human inherited disease. Much research has focused on protein-coding variants since coding regions have well-defined physicochemical and functional properties. However, many bioinformatics tools are not applicable to variants outside coding regions. Here, we increase the classification performance of our regulatory single-nucleotide variant predictor (RSVP) for variants that ca  ...[more]

Similar Datasets

| S-EPMC5707065 | biostudies-literature
| S-EPMC6301329 | biostudies-literature
2024-02-08 | GSE255117 | GEO
| S-EPMC6460560 | biostudies-literature
| S-EPMC4887298 | biostudies-literature
| S-EPMC3522194 | biostudies-literature