Unknown

Dataset Information

0

Human 5' UTR design and variant effect prediction from a massively parallel translation assay.


ABSTRACT: The ability to predict the impact of cis-regulatory sequences on gene expression would facilitate discovery in fundamental and applied biology. Here we combine polysome profiling of a library of 280,000 randomized 5' untranslated regions (UTRs) with deep learning to build a predictive model that relates human 5' UTR sequence to translation. Together with a genetic algorithm, we use the model to engineer new 5' UTRs that accurately direct specified levels of ribosome loading, providing the ability to tune sequences for optimal protein expression. We show that the same approach can be extended to chemically modified RNA, an important feature for applications in mRNA therapeutics and synthetic biology. We test 35,212 truncated human 5' UTRs and 3,577 naturally occurring variants and show that the model predicts ribosome loading of these sequences. Finally, we provide evidence of 45 single-nucleotide variants (SNVs) associated with human diseases that substantially change ribosome loading and thus may represent a molecular basis for disease.

SUBMITTER: Sample PJ 

PROVIDER: S-EPMC7100133 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Human 5' UTR design and variant effect prediction from a massively parallel translation assay.

Sample Paul J PJ   Wang Ban B   Reid David W DW   Presnyak Vlad V   McFadyen Iain J IJ   Morris David R DR   Seelig Georg G  

Nature biotechnology 20190701 7


The ability to predict the impact of cis-regulatory sequences on gene expression would facilitate discovery in fundamental and applied biology. Here we combine polysome profiling of a library of 280,000 randomized 5' untranslated regions (UTRs) with deep learning to build a predictive model that relates human 5' UTR sequence to translation. Together with a genetic algorithm, we use the model to engineer new 5' UTRs that accurately direct specified levels of ribosome loading, providing the abilit  ...[more]

Similar Datasets

2018-05-03 | GSE114002 | GEO
| PRJNA454863 | ENA
| S-EPMC5994907 | biostudies-literature
| S-EPMC6581050 | biostudies-literature
| S-EPMC5198521 | biostudies-other
2017-12-12 | GSE106677 | GEO
| S-EPMC3151293 | biostudies-literature
| PRJNA306869 | ENA
| S-EPMC9169645 | biostudies-literature
2019-11-26 | GSE140983 | GEO