Unknown

Dataset Information

0

Picking ChIP-seq peak detectors for analyzing chromatin modification experiments.


ABSTRACT: Numerous algorithms have been developed to analyze ChIP-Seq data. However, the complexity of analyzing diverse patterns of ChIP-Seq signals, especially for epigenetic marks, still calls for the development of new algorithms and objective comparisons of existing methods. We developed Qeseq, an algorithm to detect regions of increased ChIP read density relative to background. Qeseq employs critical novel elements, such as iterative recalibration and neighbor joining of reads to identify enriched regions of any length. To objectively assess its performance relative to other 14 ChIP-Seq peak finders, we designed a novel protocol based on Validation Discriminant Analysis (VDA) to optimally select validation sites and generated two validation datasets, which are the most comprehensive to date for algorithmic benchmarking of key epigenetic marks. In addition, we systematically explored a total of 315 diverse parameter configurations from these algorithms and found that typically optimal parameters in one dataset do not generalize to other datasets. Nevertheless, default parameters show the most stable performance, suggesting that they should be used. This study also provides a reproducible and generalizable methodology for unbiased comparative analysis of high-throughput sequencing tools that can facilitate future algorithmic development.

SUBMITTER: Micsinai M 

PROVIDER: S-EPMC3351193 | biostudies-literature | 2012 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Picking ChIP-seq peak detectors for analyzing chromatin modification experiments.

Micsinai Mariann M   Parisi Fabio F   Strino Francesco F   Asp Patrik P   Dynlacht Brian D BD   Kluger Yuval Y  

Nucleic acids research 20120203 9


Numerous algorithms have been developed to analyze ChIP-Seq data. However, the complexity of analyzing diverse patterns of ChIP-Seq signals, especially for epigenetic marks, still calls for the development of new algorithms and objective comparisons of existing methods. We developed Qeseq, an algorithm to detect regions of increased ChIP read density relative to background. Qeseq employs critical novel elements, such as iterative recalibration and neighbor joining of reads to identify enriched r  ...[more]

Similar Datasets

| S-EPMC5408812 | biostudies-literature
| S-EPMC6547432 | biostudies-literature
| S-EPMC3032669 | biostudies-literature
| S-EPMC4345404 | biostudies-literature
| S-EPMC3677880 | biostudies-literature
| S-EPMC8253552 | biostudies-literature
| S-EPMC3045301 | biostudies-literature
| S-EPMC2596672 | biostudies-literature
| S-EPMC2900203 | biostudies-other
| S-EPMC3672025 | biostudies-literature