Other

Dataset Information

0

Learning the sequence code for mRNA and protein abundance in human immune cells


ABSTRACT: mRNA and protein abundance are defined by transcriptional and post-transcriptional regulatory mechanisms. Here, we develop a machine learning pipeline, termed SONAR, to decipher the endogenous sequence code that determines mRNA and protein abundance in human cells. SONAR models predict up to 62% of mRNA and 63% of protein abundance independent of promoter or enhancer information, and reveal a strong—yet dynamic—cell-type specific sequence code. We also find that the effect of sequence features is dependent on their location within the mRNA transcript. Using SONAR, we design synthetic 3’UTRs, with which protein expression levels can be manipulated and tailored to a specific cell-type. Beyond its fundamental findings, our work provides novel means to improve immunotherapies and biotechnology applications.

ORGANISM(S): Homo sapiens

PROVIDER: GSE240919 | GEO | 2023/09/20

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

| PRJNA1005690 | ENA
2024-07-29 | GSE267857 | GEO
2016-10-06 | GSE86035 | GEO
2024-10-22 | GSE280041 | GEO
| PRJNA632919 | ENA
2021-05-06 | GSE150617 | GEO
2022-01-28 | GSE194394 | GEO
2022-01-28 | GSE194393 | GEO
2022-01-28 | GSE194392 | GEO
2022-01-28 | GSE194391 | GEO