Unknown

Dataset Information

0

GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding.


ABSTRACT: The majority of disease-associated variants identified in genome-wide association studies reside in noncoding regions of the genome with regulatory roles. Thus being able to interpret the functional consequence of a variant is essential for identifying causal variants in the analysis of genome-wide association studies.We present GERV (generative evaluation of regulatory variants), a novel computational method for predicting regulatory variants that affect transcription factor binding. GERV learns a k-mer-based generative model of transcription factor binding from ChIP-seq and DNase-seq data, and scores variants by computing the change of predicted ChIP-seq reads between the reference and alternate allele. The k-mers learned by GERV capture more sequence determinants of transcription factor binding than a motif-based approach alone, including both a transcription factor's canonical motif and associated co-factor motifs. We show that GERV outperforms existing methods in predicting single-nucleotide polymorphisms associated with allele-specific binding. GERV correctly predicts a validated causal variant among linked single-nucleotide polymorphisms and prioritizes the variants previously reported to modulate the binding of FOXA1 in breast cancer cell lines. Thus, GERV provides a powerful approach for functionally annotating and prioritizing causal variants for experimental follow-up analysis.The implementation of GERV and related data are available at http://gerv.csail.mit.edu/.

SUBMITTER: Zeng H 

PROVIDER: S-EPMC5860000 | biostudies-literature | 2016 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding.

Zeng Haoyang H   Hashimoto Tatsunori T   Kang Daniel D DD   Gifford David K DK  

Bioinformatics (Oxford, England) 20151017 4


<h4>Motivation</h4>The majority of disease-associated variants identified in genome-wide association studies reside in noncoding regions of the genome with regulatory roles. Thus being able to interpret the functional consequence of a variant is essential for identifying causal variants in the analysis of genome-wide association studies.<h4>Results</h4>We present GERV (generative evaluation of regulatory variants), a novel computational method for predicting regulatory variants that affect trans  ...[more]

Similar Datasets

| S-EPMC2266803 | biostudies-literature
| S-EPMC2241927 | biostudies-literature
| S-EPMC6677801 | biostudies-literature
| S-EPMC2800119 | biostudies-literature
| S-EPMC6662294 | biostudies-literature
2019-06-04 | GSE117576 | GEO
| S-EPMC2638147 | biostudies-literature
| S-EPMC4643619 | biostudies-literature
| S-EPMC6906655 | biostudies-literature
2019-06-04 | GSE117565 | GEO