Unknown

Dataset Information

0

Identifying peaks in *-seq data using shape information.


ABSTRACT: Peak calling is a fundamental step in the analysis of data generated by ChIP-seq or similar techniques to acquire epigenetics information. Current peak callers are often hard to parameterise and may therefore be difficult to use for non-bioinformaticians. In this paper, we present the ChIP-seq analysis tool available in CLC Genomics Workbench and CLC Genomics Server (version 7.5 and up), a user-friendly peak-caller designed to be not specific to a particular *-seq protocol.We illustrate the advantages of a shape-based approach and describe the algorithmic principles underlying the implementation. Thanks to the generality of the idea and the fact the algorithm is able to learn the peak shape from the data, the implementation requires only minimal user input, while still being applicable to a range of *-seq protocols. Using independently validated benchmark datasets, we compare our implementation to other state-of-the-art algorithms explicitly designed to analyse ChIP-seq data and provide an evaluation in terms of receiver-operator characteristic (ROC) plots. In order to show the applicability of the method to similar *-seq protocols, we also investigate algorithmic performances on DNase-seq data.The results show that CLC shape-based peak caller ranks well among popular state-of-the-art peak callers while providing flexibility and ease-of-use.

SUBMITTER: Strino F 

PROVIDER: S-EPMC4905608 | biostudies-literature | 2016 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying peaks in *-seq data using shape information.

Strino Francesco F   Lappe Michael M  

BMC bioinformatics 20160606


<h4>Background</h4>Peak calling is a fundamental step in the analysis of data generated by ChIP-seq or similar techniques to acquire epigenetics information. Current peak callers are often hard to parameterise and may therefore be difficult to use for non-bioinformaticians. In this paper, we present the ChIP-seq analysis tool available in CLC Genomics Workbench and CLC Genomics Server (version 7.5 and up), a user-friendly peak-caller designed to be not specific to a particular *-seq protocol.<h4  ...[more]

Similar Datasets

| S-EPMC3120977 | biostudies-literature
| S-EPMC3946423 | biostudies-literature
| S-EPMC4806451 | biostudies-literature
| S-EPMC4561497 | biostudies-literature
| S-EPMC3870998 | biostudies-literature
| S-EPMC4066778 | biostudies-literature
| S-EPMC8532352 | biostudies-literature
| S-EPMC5675649 | biostudies-literature
| S-EPMC5001242 | biostudies-literature
| S-EPMC3868217 | biostudies-literature