Unknown

Dataset Information

0

A cis-regulatory lexicon of DNA motif combinations mediating cell-type-specific gene regulation.


ABSTRACT: Gene expression is controlled by transcription factors (TFs) that bind cognate DNA motif sequences in cis-regulatory elements (CREs). The combinations of DNA motifs acting within homeostasis and disease, however, are unclear. Gene expression, chromatin accessibility, TF footprinting, and H3K27ac-dependent DNA looping data were generated and a random-forest-based model was applied to identify 7,531 cell-type-specific cis-regulatory modules (CRMs) across 15 diploid human cell types. A co-enrichment framework within CRMs nominated 838 cell-type-specific, recurrent heterotypic DNA motif combinations (DMCs), which were functionally validated using massively parallel reporter assays. Cancer cells engaged DMCs linked to neoplasia-enabling processes operative in normal cells while also activating new DMCs only seen in the neoplastic state. This integrative approach identifies cell-type-specific cis-regulatory combinatorial DNA motifs in diverse normal and diseased human cells and represents a general framework for deciphering cis-regulatory sequence logic in gene regulation.

SUBMITTER: Donohue LKH 

PROVIDER: S-EPMC9894309 | biostudies-literature | 2022 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A <i>cis</i>-regulatory lexicon of DNA motif combinations mediating cell-type-specific gene regulation.

Donohue Laura K H LKH   Guo Margaret G MG   Zhao Yang Y   Jung Namyoung N   Bussat Rose T RT   Kim Daniel S DS   Neela Poornima H PH   Kellman Laura N LN   Garcia Omar S OS   Meyers Robin M RM   Altman Russ B RB   Khavari Paul A PA  

Cell genomics 20221005 11


Gene expression is controlled by transcription factors (TFs) that bind cognate DNA motif sequences in <i>cis</i>-regulatory elements (CREs). The combinations of DNA motifs acting within homeostasis and disease, however, are unclear. Gene expression, chromatin accessibility, TF footprinting, and H3K27ac-dependent DNA looping data were generated and a random-forest-based model was applied to identify 7,531 cell-type-specific <i>cis</i>-regulatory modules (CRMs) across 15 diploid human cell types.  ...[more]

Similar Datasets

2022-08-17 | GSE188405 | GEO
2022-08-17 | GSE188403 | GEO
2022-08-17 | GSE188401 | GEO
2022-08-17 | GSE188398 | GEO
2022-08-17 | GSE186947 | GEO
| PRJNA778734 | ENA
| PRJNA778736 | ENA
| PRJNA777829 | ENA
| PRJNA778728 | ENA
| PRJNA778732 | ENA