Unknown

Dataset Information

0

Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types.


ABSTRACT: The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these CREs. Here, we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of over 680,000 sequences, representing a nearly comprehensive set of all annotated CREs among three cell types (HepG2, K562, and WTC11), finding 41.7% to be functional. By testing sequences in both orientations, we find promoters to have significant strand orientation effects. We also observe that their 200 nucleotide cores function as non-cell-type-specific 'on switches' providing similar expression levels to their associated gene. In contrast, enhancers have weaker orientation effects, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict CRE function with high accuracy and delineate regulatory motifs. Testing an additional lentiMPRA library encompassing 60,000 CREs in all three cell types, we further identified factors that determine cell-type specificity. Collectively, our work provides an exhaustive catalog of functional CREs in three widely used cell lines, and showcases how large-scale functional measurements can be used to dissect regulatory grammar.

SUBMITTER: Agarwal V 

PROVIDER: S-EPMC10028905 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types.

Agarwal Vikram V   Inoue Fumitaka F   Schubach Max M   Martin Beth K BK   Dash Pyaree Mohan PM   Zhang Zicong Z   Sohota Ajuni A   Noble William Stafford WS   Yardimci Galip Gürkan GG   Kircher Martin M   Shendure Jay J   Ahituv Nadav N  

bioRxiv : the preprint server for biology 20230306


The human genome contains millions of candidate <i>cis</i>-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these CREs. Here, we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of over 680,000 sequences, representing a nearly comprehensive set of all an  ...[more]

Similar Datasets

| S-EPMC11903340 | biostudies-literature
| S-EPMC6850896 | biostudies-literature
| S-EPMC9949039 | biostudies-literature
| S-EPMC3454335 | biostudies-literature
| S-EPMC6771677 | biostudies-literature
| S-EPMC8298436 | biostudies-literature
2019-10-23 | GSE115046 | GEO
| S-EPMC3675476 | biostudies-literature
| S-EPMC5241818 | biostudies-literature
| S-EPMC11839127 | biostudies-literature