Unknown

Dataset Information

0

Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite.


ABSTRACT:

Background

Chromatin immunoprecipitation combined with the next-generation DNA sequencing technologies (ChIP-seq) becomes a key approach for detecting genome-wide sets of genomic sites bound by proteins, such as transcription factors (TFs). Several methods and open-source tools have been developed to analyze ChIP-seq data. However, most of them are designed for detecting TF binding regions instead of accurately locating transcription factor binding sites (TFBSs). It is still challenging to pinpoint TFBSs directly from ChIP-seq data, especially in regions with closely spaced binding events.

Results

With the aim to pinpoint TFBSs at a high resolution, we propose a novel method named SeqSite, implementing a two-step strategy: detecting tag-enriched regions first and pinpointing binding sites in the detected regions. The second step is done by modeling the tag density profile, locating TFBSs on each strand with a least-squares model fitting strategy, and merging the detections from the two strands. Experiments on simulation data show that SeqSite can locate most of the binding sites more than 40-bp from each other. Applications on three human TF ChIP-seq datasets demonstrate the advantage of SeqSite for its higher resolution in pinpointing binding sites compared with existing methods.

Conclusions

We have developed a computational tool named SeqSite, which can pinpoint both closely spaced and isolated binding sites, and consequently improves the resolution of TFBS detection from ChIP-seq data.

SUBMITTER: Wang X 

PROVIDER: S-EPMC3287483 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite.

Wang Xi X   Zhang Xuegong X  

BMC systems biology 20111214


<h4>Background</h4>Chromatin immunoprecipitation combined with the next-generation DNA sequencing technologies (ChIP-seq) becomes a key approach for detecting genome-wide sets of genomic sites bound by proteins, such as transcription factors (TFs). Several methods and open-source tools have been developed to analyze ChIP-seq data. However, most of them are designed for detecting TF binding regions instead of accurately locating transcription factor binding sites (TFBSs). It is still challenging  ...[more]

Similar Datasets

| S-EPMC2917543 | biostudies-literature
| S-EPMC3799470 | biostudies-literature
| S-EPMC2853110 | biostudies-literature
| S-EPMC3245948 | biostudies-literature
| S-EPMC4082612 | biostudies-literature
| S-EPMC6199713 | biostudies-literature
| S-EPMC3798280 | biostudies-literature
| S-EPMC4234207 | biostudies-literature
| S-EPMC8158016 | biostudies-literature
| S-EPMC3946423 | biostudies-literature