Unknown

Dataset Information

0

A Statistical Framework for the Analysis of ChIP-Seq Data.


ABSTRACT: Chromatin immunoprecipitation followed by sequencing (ChIP-Seq) has revolutionalized experiments for genome-wide profiling of DNA-binding proteins, histone modifications, and nucleosome occupancy. As the cost of sequencing is decreasing, many researchers are switching from microarray-based technologies (ChIP-chip) to ChIP-Seq for genome-wide study of transcriptional regulation. Despite its increasing and well-deserved popularity, there is little work that investigates and accounts for sources of biases in the ChIP-Seq technology. These biases typically arise from both the standard pre-processing protocol and the underlying DNA sequence of the generated data. We study data from a naked DNA sequencing experiment, which sequences non-cross-linked DNA after deproteinizing and shearing, to understand factors affecting background distribution of data generated in a ChIP-Seq experiment. We introduce a background model that accounts for apparent sources of biases such as mappability and GC content and develop a flexible mixture model named MOSAiCS for detecting peaks in both one- and two-sample analyses of ChIP-Seq data. We illustrate that our model fits observed ChIP-Seq data well and further demonstrate advantages of MOSAiCS over commonly used tools for ChIP-Seq data analysis with several case studies.

SUBMITTER: Kuan PF 

PROVIDER: S-EPMC4608541 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Statistical Framework for the Analysis of ChIP-Seq Data.

Kuan Pei Fen PF   Chung Dongjun D   Pan Guangjin G   Thomson James A JA   Stewart Ron R   Keleş Sündüz S  

Journal of the American Statistical Association 20110101 495


Chromatin immunoprecipitation followed by sequencing (ChIP-Seq) has revolutionalized experiments for genome-wide profiling of DNA-binding proteins, histone modifications, and nucleosome occupancy. As the cost of sequencing is decreasing, many researchers are switching from microarray-based technologies (ChIP-chip) to ChIP-Seq for genome-wide study of transcriptional regulation. Despite its increasing and well-deserved popularity, there is little work that investigates and accounts for sources of  ...[more]

Similar Datasets

| S-EPMC3957067 | biostudies-literature
| S-EPMC3717085 | biostudies-other
| S-EPMC3218220 | biostudies-literature
| S-EPMC3658457 | biostudies-literature
| S-EPMC3053263 | biostudies-literature
| S-EPMC3188793 | biostudies-literature
| S-EPMC3931556 | biostudies-literature
| S-EPMC2732365 | biostudies-literature
| S-EPMC5499643 | biostudies-literature
| S-EPMC3828144 | biostudies-literature