Transcriptomics

Dataset Information

0

Systematic Interrogation of Human Promoters


ABSTRACT: Despite much research, our understanding of the architecture and cis-regulatory elements of human promoters is still lacking. Here, we devised a high-throughput assay to quantify the activity of ~15,000 fully designed sequences that we integrated and expressed from a fixed location within the human genome. We used this method to investigate thousands of native promoters and pre-initiation complex (PIC) binding regions followed by in-depth characterization of the sequence motifs underlying promoter activity including core promoter elements and TF binding-sites. We find that core promoters drive transcription mostly unidirectionally, and that sequences originating from promoters exhibit stronger activity than those originating from enhancers. Testing multiple synthetic configurations of core promoter elements, we dissect the motifs that positively and negatively regulate transcription as well as the effect of their combinations and distances, including a 10bp periodicity in the optimal distance between the TATA and the Initiator. By comprehensively screening 133 TF binding-sites, we find that in contrast to core promoters, TF binding-sites maintain similar activity levels in both orientations supporting a model by which divergent transcription is driven by two distinct unidirectional core promoters sharing bidirectional TF binding-sites. Finally, we find a striking agreement between the effect of binding site multiplicity of individual TFs in our assay and their tendency to appear in homotypic clusters throughout the genome. Overall, our study systematically assays the elements that drive expression in core- and proximal- promoter regions and shed light on organization principles of regulatory regions in the human genome.

ORGANISM(S): synthetic construct

PROVIDER: GSE118242 | GEO | 2018/08/08

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2005-01-01 | MODEL1202270000 | BioModels
2005-01-01 | MODEL1202270001 | BioModels
2016-05-23 | GSE77213 | GEO
2019-01-08 | GSE117594 | GEO
2016-12-14 | GSE92306 | GEO
2013-02-01 | E-GEOD-42117 | biostudies-arrayexpress
2013-04-01 | E-GEOD-34975 | biostudies-arrayexpress
2014-11-10 | GSE60455 | GEO
2014-11-10 | GSE60454 | GEO
2014-11-10 | GSE60453 | GEO