Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

DNase I hypersensitivity and algorithmic prediction of TF binding in early pancreatic mES directed differentiation


ABSTRACT: This dataset uses DNase-seq to profile the genome-wide DNase I hypersensitivity of mES and mES-derived cells along an early pancreatic lineage and provides the locations of putative Transcription Factor (TF) binding sites using the PIQ algorithm. DNase-seq takes advantage of the preferential cutting of DNase I in open chromatin and steric blockage of of DNase I by tightly bound TFs that protect associated genomic DNA sequences. After deep sequencing of DNase IM-bM-^@M-^Sdigested genomic DNA from intact nuclei, genome-wide data on chromatin accessibility as well as TF-specific DNase I protection profiles that reveal the genomic binding locations of a majority of TFs are obtained. Such TF signature M-bM-^@M-^XDNase profilesM-bM-^@M-^Y reflect the effect of the TF on DNA shape and local chromatin architecture, extending hundreds of base pairs from a TF binding site, and these profiles are centered on M-bM-^@M-^XDNase footprintsM-bM-^@M-^Y at the binding motif itself, which reflects the biophysics of protein-DNA binding. An algorithm, PIQ, is then applied that models the specific profile of each factor, and in combination with sequence information predicts the likely binding locations of over 700 TFs genome wide. This dataset includes DNase-seq hypersensitivity data from 6 mES-derived cell types: mESC, Mesendoderm, Mesoderm, Endoderm, Intestinal Endoderm, and Prepancreatic Endoderm. For each cell type, TF binding site predictions are made based on the identification TF-specific DNase-seq profiles over any of 1331 possible binding motifs. After significance thesholding, genome-wide binding site predictions for <700 TFs are included.

ORGANISM(S): Mus musculus

SUBMITTER: David Gifford 

PROVIDER: E-GEOD-53776 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications

Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape.

Sherwood Richard I RI   Hashimoto Tatsunori T   O'Donnell Charles W CW   Lewis Sophia S   Barkal Amira A AA   van Hoff John Peter JP   Karun Vivek V   Jaakkola Tommi T   Gifford David K DK  

Nature biotechnology 20140119 2


We describe protein interaction quantitation (PIQ), a computational method for modeling the magnitude and shape of genome-wide DNase I hypersensitivity profiles to identify transcription factor (TF) binding sites. Through the use of machine-learning techniques, PIQ identified binding sites for >700 TFs from one DNase I hypersensitivity analysis followed by sequencing (DNase-seq) experiment with accuracy comparable to that of chromatin immunoprecipitation followed by sequencing (ChIP-seq). We app  ...[more]

Similar Datasets

2011-11-09 | E-GEOD-26610 | biostudies-arrayexpress
2013-12-15 | E-GEOD-53322 | biostudies-arrayexpress
2012-08-23 | E-GEOD-40255 | biostudies-arrayexpress
2014-11-05 | E-GEOD-54139 | biostudies-arrayexpress
2013-12-14 | E-GEOD-45672 | biostudies-arrayexpress
2023-01-11 | E-MTAB-12512 | biostudies-arrayexpress
2015-06-25 | E-GEOD-68864 | biostudies-arrayexpress
2012-08-01 | E-GEOD-36979 | biostudies-arrayexpress
2014-05-08 | E-GEOD-50762 | biostudies-arrayexpress
2012-02-17 | E-GEOD-35875 | biostudies-arrayexpress