Unknown

Dataset Information

0

A map of direct TF-DNA interactions in the human genome.


ABSTRACT: Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most popular assay to identify genomic regions, called ChIP-seq peaks, that are bound in vivo by transcription factors (TFs). These regions are derived from direct TF-DNA interactions, indirect binding of the TF to the DNA (through a co-binding partner), nonspecific binding to the DNA, and noise/bias/artifacts. Delineating the bona fide direct TF-DNA interactions within the ChIP-seq peaks remains challenging. We developed a dedicated software, ChIP-eat, that combines computational TF binding models and ChIP-seq peaks to automatically predict direct TF-DNA interactions. Our work culminated with predicted interactions covering >4% of the human genome, obtained by uniformly processing 1983 ChIP-seq peak data sets from the ReMap database for 232 unique TFs. The predictions were a posteriori assessed using protein binding microarray and ChIP-exo data, and were predominantly found in high quality ChIP-seq peaks. The set of predicted direct TF-DNA interactions suggested that high-occupancy target regions are likely not derived from direct binding of the TFs to the DNA. Our predictions derived co-binding TFs supported by protein-protein interaction data and defined cis-regulatory modules enriched for disease- and trait-associated SNPs. We provide this collection of direct TF-DNA interactions and cis-regulatory modules through the UniBind web-interface (http://unibind.uio.no).

SUBMITTER: Gheorghe M 

PROVIDER: S-EPMC6393237 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A map of direct TF-DNA interactions in the human genome.

Gheorghe Marius M   Sandve Geir Kjetil GK   Khan Aziz A   Chèneby Jeanne J   Ballester Benoit B   Mathelier Anthony A  

Nucleic acids research 20190201 4


Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most popular assay to identify genomic regions, called ChIP-seq peaks, that are bound in vivo by transcription factors (TFs). These regions are derived from direct TF-DNA interactions, indirect binding of the TF to the DNA (through a co-binding partner), nonspecific binding to the DNA, and noise/bias/artifacts. Delineating the bona fide direct TF-DNA interactions within the ChIP-seq peaks remains challenging. We developed a d  ...[more]

Similar Datasets

| S-EPMC8236138 | biostudies-literature
| S-EPMC5046229 | biostudies-literature
| S-EPMC4248309 | biostudies-literature
| S-EPMC10942648 | biostudies-literature
| S-EPMC10245841 | biostudies-literature
| S-EPMC1878506 | biostudies-literature
2014-08-21 | E-GEOD-59395 | biostudies-arrayexpress
2014-08-21 | GSE59395 | GEO
| S-EPMC2951675 | biostudies-literature
| S-EPMC3024863 | biostudies-other