Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Dissection of thousands of cell type-specific enhancers identifies dinucleotide repeat motifs as general enhancer features


ABSTRACT: Gene expression is determined by genomic elements called enhancers, which contain short motifs bound by different transcription factors (TFs). However, how enhancer sequences and TF motifs relate to enhancer activity is unknown and general sequence requirements for enhancers or comprehensive sets of important enhancer sequence elements have remained elusive. Here, we computationally dissect thousands of functional enhancer sequences from three different Drosophila cell lines. We find that the enhancers display distinct cis-regulatory sequence signatures, which are predictive of the enhancersM-bM-^@M-^Y cell type-specific or broad activities. These signatures contain transcription factor motifs and a novel class of enhancer sequence elements, dinucleotide repeat motifs (DRMs). DRMs are highly enriched in enhancers, particularly in enhancers that are broadly active across different cell types. We experimentally validate the importance of the identified TF motifs and DRMs for enhancer function and show that they can be sufficient to create an active enhancer de novo from non-functional sequence. The function of DRMs as a novel class of general enhancer features that are also enriched in human regulatory regions might explain their implication in several diseases and provides important insights into gene regulation. STARR-seq was performed in BG3 cells with paired-end sequencing in two replicates and respective inputs.

ORGANISM(S): Drosophila melanogaster

SUBMITTER: J. Omar Yanez-Cuna 

PROVIDER: E-GEOD-49809 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2014-04-28 | E-GEOD-48251 | biostudies-arrayexpress
2014-03-27 | E-GEOD-47691 | biostudies-arrayexpress
2013-01-17 | E-GEOD-40739 | biostudies-arrayexpress
2014-12-10 | E-GEOD-57876 | biostudies-arrayexpress
2015-09-08 | E-GEOD-63782 | biostudies-arrayexpress
2021-03-09 | E-MTAB-9614 | biostudies-arrayexpress
2014-04-02 | GSE49809 | GEO
2019-11-30 | E-MTAB-7846 | biostudies-arrayexpress
2022-12-15 | GSE211657 | GEO
2022-12-15 | GSE211654 | GEO