Unknown

Dataset Information

0

Sequence-to-function deep learning frameworks for engineered riboregulators.


ABSTRACT: While synthetic biology has revolutionized our approaches to medicine, agriculture, and energy, the design of completely novel biological circuit components beyond naturally-derived templates remains challenging due to poorly understood design rules. Toehold switches, which are programmable nucleic acid sensors, face an analogous design bottleneck; our limited understanding of how sequence impacts functionality often necessitates expensive, time-consuming screens to identify effective switches. Here, we introduce Sequence-based Toehold Optimization and Redesign Model (STORM) and Nucleic-Acid Speech (NuSpeak), two orthogonal and synergistic deep learning architectures to characterize and optimize toeholds. Applying techniques from computer vision and natural language processing, we 'un-box' our models using convolutional filters, attention maps, and in silico mutagenesis. Through transfer-learning, we redesign sub-optimal toehold sensors, even with sparse training data, experimentally validating their improved performance. This work provides sequence-to-function deep learning frameworks for toehold selection and design, augmenting our ability to construct potent biological circuit components and precision diagnostics.

SUBMITTER: Valeri JA 

PROVIDER: S-EPMC7541510 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence-to-function deep learning frameworks for engineered riboregulators.

Valeri Jacqueline A JA   Collins Katherine M KM   Ramesh Pradeep P   Alcantar Miguel A MA   Lepe Bianca A BA   Lu Timothy K TK   Camacho Diogo M DM  

Nature communications 20201007 1


While synthetic biology has revolutionized our approaches to medicine, agriculture, and energy, the design of completely novel biological circuit components beyond naturally-derived templates remains challenging due to poorly understood design rules. Toehold switches, which are programmable nucleic acid sensors, face an analogous design bottleneck; our limited understanding of how sequence impacts functionality often necessitates expensive, time-consuming screens to identify effective switches.  ...[more]

Similar Datasets

| S-EPMC9755234 | biostudies-literature
| S-EPMC8165448 | biostudies-literature
| S-EPMC8719239 | biostudies-literature
| S-EPMC9279499 | biostudies-literature
2024-02-03 | GSE254493 | GEO
| S-EPMC9247339 | biostudies-literature
| S-EPMC7363850 | biostudies-literature
| S-EPMC9549170 | biostudies-literature
| S-EPMC7377280 | biostudies-literature
| S-EPMC6081423 | biostudies-literature