Unknown

Dataset Information

0

Improved detection of epigenomic marks with mixed-effects hidden Markov models.


ABSTRACT: Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) is a technique to detect genomic regions containing protein-DNA interaction, such as transcription factor binding sites or regions containing histone modifications. One goal of the analysis of ChIP-seq experiments is to identify genomic loci enriched for sequencing reads pertaining to DNA bound to the factor of interest. The accurate identification of such regions aids in the understanding of epigenomic marks and gene regulatory mechanisms. Given the reduction of massively parallel sequencing costs, methods to detect consensus regions of enrichment across multiple samples are of interest. Here, we present a statistical model to detect broad consensus regions of enrichment from ChIP-seq technical or biological replicates through a class of zero-inflated mixed-effects hidden Markov models. We show that the proposed model outperforms existing methods for consensus peak calling in common epigenomic marks by accounting for the excess zeros and sample-specific biases. We apply our method to data from the Encyclopedia of DNA Elements and Roadmap Epigenomics projects and also from an extensive simulation study.

SUBMITTER: Baldoni PL 

PROVIDER: S-EPMC6851437 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved detection of epigenomic marks with mixed-effects hidden Markov models.

Baldoni Pedro L PL   Rashid Naim U NU   Ibrahim Joseph G JG  

Biometrics 20191017 4


Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) is a technique to detect genomic regions containing protein-DNA interaction, such as transcription factor binding sites or regions containing histone modifications. One goal of the analysis of ChIP-seq experiments is to identify genomic loci enriched for sequencing reads pertaining to DNA bound to the factor of interest. The accurate identification of such regions aids in the understanding of epigenomic marks and gen  ...[more]

Similar Datasets

| S-EPMC2770071 | biostudies-literature
| S-EPMC5695931 | biostudies-literature
| S-EPMC2998602 | biostudies-literature
| S-EPMC6868114 | biostudies-literature
| S-EPMC6504179 | biostudies-literature
| S-EPMC4009397 | biostudies-literature
| S-EPMC3563974 | biostudies-literature
| S-EPMC1636350 | biostudies-literature
| S-EPMC2857806 | biostudies-other
2012-10-18 | GSE34490 | GEO