Dataset Information

OccuPeak: ChIP-Seq peak calling based on internal background modelling.

ABSTRACT:

Unlabelled

ChIP-seq has become a major tool for the genome-wide identification of transcription factor binding or histone modification sites. Most peak-calling algorithms require input control datasets to model the occurrence of background reads to account for local sequencing and GC bias. However, the GC-content of reads in Input-seq datasets deviates significantly from that in ChIP-seq datasets. Moreover, we observed that a commonly used peak calling program performed equally well when the use of a simulated uniform background set was compared to an Input-seq dataset. This contradicts the assumption that input control datasets are necessary to fatefully reflect the background read distribution. Because the GC-content of the abundant single reads in ChIP-seq datasets is similar to those of randomly sampled regions we designed a peak-calling algorithm with a background model based on overlapping single reads. The application, OccuPeak, uses the abundant low frequency tags present in each ChIP-seq dataset to model the background, thereby avoiding the need for additional datasets. Analysis of the performance of OccuPeak showed robust model parameters. Its measure of peak significance, the excess ratio, is only dependent on the tag density of a peak and the global noise levels. Compared to the commonly used peak-calling applications MACS and CisGenome, OccuPeak had the highest sensitivity in an enhancer identification benchmark test, and performed similar in an overlap tests of transcription factor occupation with DNase I hypersensitive sites and H3K27ac sites. Moreover, peaks called by OccuPeak were significantly enriched with cardiac disease-associated SNPs. OccuPeak runs as a standalone application and does not require extensive tweaking of parameters, making its use straightforward and user friendly.

Availability

http://occupeak.hfrc.nl.

SUBMITTER: de Boer BA

PROVIDER: S-EPMC4061025 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

OccuPeak: ChIP-Seq peak calling based on internal background modelling.

de Boer Bouke A BA van Duijvenboden Karel K van den Boogaard Malou M Christoffels Vincent M VM Barnett Phil P Ruijter Jan M JM

PloS one 20140617 6

<h4>Unlabelled</h4>ChIP-seq has become a major tool for the genome-wide identification of transcription factor binding or histone modification sites. Most peak-calling algorithms require input control datasets to model the occurrence of background reads to account for local sequencing and GC bias. However, the GC-content of reads in Input-seq datasets deviates significantly from that in ChIP-seq datasets. Moreover, we observed that a commonly used peak calling program performed equally well when ...[more]

PMID: 24936875

Dataset Information

OccuPeak: ChIP-Seq peak calling based on internal background modelling.

Unlabelled

Availability

Publications

OccuPeak: ChIP-Seq peak calling based on internal background modelling.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Graph Peak Caller: Calling ChIP-seq peaks on graph-based reference genomes.
| S-EPMC6396939 | biostudies-literature

Features that define the best ChIP-seq peak calling algorithms.
| S-EPMC5429005 | biostudies-literature

WACS: improving ChIP-seq peak calling by optimally weighting controls.
| S-EPMC7885521 | biostudies-literature

NEXT-peak: a normal-exponential two-peak model for peak-calling in ChIP-seq data.
| S-EPMC3672025 | biostudies-literature

Differential peak calling of ChIP-seq signals with replicates with THOR.
| S-EPMC5175345 | biostudies-literature

Comparative analysis of commonly used peak calling programs for ChIP-Seq analysis.
| S-EPMC7808876 | biostudies-literature

Shape-based peak identification for ChIP-Seq.
| S-EPMC3032669 | biostudies-literature

PePr: a peak-calling prioritization pipeline to identify consistent or differential peaks from replicated ChIP-Seq data.
| S-EPMC4155259 | biostudies-literature

PeaKDEck: a kernel density estimator-based peak calling program for DNaseI-seq data.
| S-EPMC3998130 | biostudies-literature

Characterising ChIP-seq binding patterns by model-based peak shape deconvolution.
| S-EPMC4046686 | biostudies-literature