Unknown

Dataset Information

0

A dynamic Bayesian network for identifying protein-binding footprints from single molecule-based sequencing data.


ABSTRACT:

Motivation

A global map of transcription factor binding sites (TFBSs) is critical to understanding gene regulation and genome function. DNaseI digestion of chromatin coupled with massively parallel sequencing (digital genomic footprinting) enables the identification of protein-binding footprints with high resolution on a genome-wide scale. However, accurately inferring the locations of these footprints remains a challenging computational problem.

Results

We present a dynamic Bayesian network-based approach for the identification and assignment of statistical confidence estimates to protein-binding footprints from digital genomic footprinting data. The method, DBFP, allows footprints to be identified in a probabilistic framework and outperforms our previously described algorithm in terms of precision at a fixed recall. Applied to a digital footprinting data set from Saccharomyces cerevisiae, DBFP identifies 4679 statistically significant footprints within intergenic regions. These footprints are mainly located near transcription start sites and are strongly enriched for known TFBSs. Footprints containing no known motif are preferentially located proximal to other footprints, consistent with cooperative binding of these footprints. DBFP also identifies a set of statistically significant footprints in the yeast coding regions. Many of these footprints coincide with the boundaries of antisense transcripts, and the most significant footprints are enriched for binding sites of the chromatin-associated factors Abf1 and Rap1.

Supplementary information

Supplementary material is available at Bioinformatics online.

SUBMITTER: Chen X 

PROVIDER: S-EPMC2881360 | biostudies-literature | 2010 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A dynamic Bayesian network for identifying protein-binding footprints from single molecule-based sequencing data.

Chen Xiaoyu X   Hoffman Michael M MM   Bilmes Jeff A JA   Hesselberth Jay R JR   Noble William S WS  

Bioinformatics (Oxford, England) 20100601 12


<h4>Motivation</h4>A global map of transcription factor binding sites (TFBSs) is critical to understanding gene regulation and genome function. DNaseI digestion of chromatin coupled with massively parallel sequencing (digital genomic footprinting) enables the identification of protein-binding footprints with high resolution on a genome-wide scale. However, accurately inferring the locations of these footprints remains a challenging computational problem.<h4>Results</h4>We present a dynamic Bayes  ...[more]

Similar Datasets

| S-EPMC6336789 | biostudies-literature
| S-EPMC4480965 | biostudies-literature
| S-EPMC9503243 | biostudies-literature
| S-EPMC5763373 | biostudies-other
| S-EPMC5737501 | biostudies-literature
| S-EPMC7487589 | biostudies-literature
| S-EPMC9017157 | biostudies-literature
| S-EPMC7410337 | biostudies-literature
| S-EPMC4467879 | biostudies-literature
| S-EPMC8168892 | biostudies-literature