Unknown

Dataset Information

0

Model-driven generation of artificial yeast promoters.


ABSTRACT: Promoters play a central role in controlling gene regulation; however, a small set of promoters is used for most genetic construct design in the yeast Saccharomyces cerevisiae. Generating and utilizing models that accurately predict protein expression from promoter sequences would enable rapid generation of useful promoters and facilitate synthetic biology efforts in this model organism. We measure the gene expression activity of over 675,000 sequences in a constitutive promoter library and over 327,000 sequences in an inducible promoter library. Training an ensemble of convolutional neural networks jointly on the two data sets enables very high (R2?>?0.79) predictive accuracies on multiple sequence-activity prediction tasks. We describe model-guided design strategies that yield large, sequence-diverse sets of promoters exhibiting activities higher than those represented in training data and similar to current best-in-class sequences. Our results show the value of model-guided design as an approach for generating useful DNA parts.

SUBMITTER: Kotopka BJ 

PROVIDER: S-EPMC7192914 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Model-driven generation of artificial yeast promoters.

Kotopka Benjamin J BJ   Smolke Christina D CD  

Nature communications 20200430 1


Promoters play a central role in controlling gene regulation; however, a small set of promoters is used for most genetic construct design in the yeast Saccharomyces cerevisiae. Generating and utilizing models that accurately predict protein expression from promoter sequences would enable rapid generation of useful promoters and facilitate synthetic biology efforts in this model organism. We measure the gene expression activity of over 675,000 sequences in a constitutive promoter library and over  ...[more]

Similar Datasets

2019-12-01 | GSE135464 | GEO
| PRJNA558976 | ENA
| S-EPMC6200791 | biostudies-other
| S-EPMC5041464 | biostudies-literature
| S-EPMC2766638 | biostudies-literature
| S-EPMC10990938 | biostudies-literature
| S-EPMC6267961 | biostudies-literature
| S-EPMC149202 | biostudies-literature
| S-EPMC5457518 | biostudies-literature
| S-EPMC3673867 | biostudies-other