Dataset Information

TF ChIP-seq Drosophila melanogaster

ABSTRACT: A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here we present the culmination of the efforts of the modENCODE (Model Organism ENCyclopedia Of DNA Elements) and modERN (model organism Encyclopedia of Regulatory Networks) consortia to systematically assay TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). These datasets comprise 605 TFs identifying 3.6M sites in the fly and 356 TFs identifying 0.9 M sites in the worm, and represent the majority of the regulatory space in each genome. We demonstrate that TFs associate with chromatin in clusters termed “metapeaks”, that larger metapeaks have characteristics of high occupancy target (HOT) regions, and that the importance of consensus sequence motifs bound by TFs depends on metapeak size and complexity. Combining ChIP-seq data with single cell RNA-seq data in a machine learning model identifies particular TFs with a prominent role in promoting target gene expression in specific cell types, even differentiating between parent-daughter cells during embryogenesis. These data are a rich resource for the community that should fuel and guide future investigations into TF function. To facilitate data accessibility and utility, all strains expressing GFP-tagged TFs are available at the stock centers for each organism. The chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center, GEO, and through a direct interface (http://epic.gs.washington.edu/modERN/) that provides rapid access to processed data sets and summary analyses, as well as widgets to probe the cell type-specific TF-target relationships.

ORGANISM(S): Drosophila melanogaster

PROVIDER: GSE278063 | GEO | 2024/10/06

REPOSITORIES: GEO

ACCESS DATA

Shared Molecules

Only show the datasets with similarity scores above: 0.5

Threshold

0.5

Similar Datasets

Project description:Understanding the regulatory genome remains a significant challenge. Annotation of regulatory elements and identification of the transcription factors (TFs) targeting these elements are key steps in understanding how a given cell interprets its genetic blueprint. One goal of the modENCODE (model organism Encyclopedia of DNA Elements) project is to survey a diverse sampling of TFs, both DNA-binding and non-DNA binding factors, to provide a framework for the subsequent study of the mechanisms by which transcriptional regulators target the genome. Here we provide an updated map of the Drosophila melanogaster regulatory genome based on the location of 84 TFs at various stages of development. This regulatory map reveals a variety of genomic targeting patterns, including factors with strong preferences toward proximal promoter binding, factors that target intergenic and intronic DNA, and factors with distinct chromatin state preferences. The data also suggest the existence of a partially self-contained Polycomb regulatory network, and highlight the importance of Trithorax-like (Trl) in maintaining hotspots of DNA binding throughout development. Furthermore, the data identify over 5,800 instances in which TFs target DNA regions with demonstrated enhancer activity. Regions of high TF co-occupancy are more likely to be associated with open enhancers used across cell types, while lower TF occupancy regions are associated with complex enhancers that are also regulated at the epigenetic level. A putative regulatory network generated based on these 84 regulators reveals hundreds of co-binding events, thousands of potential regulatory interactions, and distinct regulatory strategies at developmental and housekeeping genes. These data serve as a resource for the research community in the continued effort to dissect transcriptional regulatory mechanisms directing Drosophila development. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf This is a dataset generated by the Drosophila Regulatory Elements modENCODE Project led by Kevin P. White at the University of Chicago.