Unknown

Dataset Information

0

Broad-Enrich: functional interpretation of large sets of broad genomic regions.


ABSTRACT:

Motivation

Functional enrichment testing facilitates the interpretation of Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) data in terms of pathways and other biological contexts. Previous methods developed and used to test for key gene sets affected in ChIP-seq experiments treat peaks as points, and are based on the number of peaks associated with a gene or a binary score for each gene. These approaches work well for transcription factors, but histone modifications often occur over broad domains, and across multiple genes.

Results

To incorporate the unique properties of broad domains into functional enrichment testing, we developed Broad-Enrich, a method that uses the proportion of each gene's locus covered by a peak. We show that our method has a well-calibrated false-positive rate, performing well with ChIP-seq data having broad domains compared with alternative approaches. We illustrate Broad-Enrich with 55 ENCODE ChIP-seq datasets using different methods to define gene loci. Broad-Enrich can also be applied to other datasets consisting of broad genomic domains such as copy number variations.

Availability and implementation

http://broad-enrich.med.umich.edu for Web version and R package.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Cavalcante RG 

PROVIDER: S-EPMC4147897 | biostudies-literature | 2014 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Broad-Enrich: functional interpretation of large sets of broad genomic regions.

Cavalcante Raymond G RG   Lee Chee C   Welch Ryan P RP   Patil Snehal S   Weymouth Terry T   Scott Laura J LJ   Sartor Maureen A MA  

Bioinformatics (Oxford, England) 20140901 17


<h4>Motivation</h4>Functional enrichment testing facilitates the interpretation of Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) data in terms of pathways and other biological contexts. Previous methods developed and used to test for key gene sets affected in ChIP-seq experiments treat peaks as points, and are based on the number of peaks associated with a gene or a binary score for each gene. These approaches work well for transcription factors, but histone mod  ...[more]

Similar Datasets

| S-EPMC3245020 | biostudies-literature
| S-EPMC2375126 | biostudies-literature
| S-EPMC7003681 | biostudies-literature
| S-EPMC4840234 | biostudies-literature
| S-EPMC4617659 | biostudies-literature
| S-EPMC6886512 | biostudies-literature
| S-EPMC5838936 | biostudies-literature
| S-EPMC4214737 | biostudies-literature
| S-EPMC1087789 | biostudies-literature
| S-EPMC4695051 | biostudies-literature