Unknown

Dataset Information

0

Machine learning-aided design and screening of an emergent protein function in synthetic cells.


ABSTRACT: Recently, utilization of Machine Learning (ML) has led to astonishing progress in computational protein design, bringing into reach the targeted engineering of proteins for industrial and biomedical applications. However, the design of proteins for emergent functions of core relevance to cells, such as the ability to spatiotemporally self-organize and thereby structure the cellular space, is still extremely challenging. While on the generative side conditional generative models and multi-state design are on the rise, for emergent functions there is a lack of tailored screening methods as typically needed in a protein design project, both computational and experimental. Here we describe a proof-of-principle of how such screening, in silico and in vitro, can be achieved for ML-generated variants of a protein that forms intracellular spatiotemporal patterns. For computational screening we use a structure-based divide-and-conquer approach to find the most promising candidates, while for the subsequent in vitro screening we use synthetic cell-mimics as established by Bottom-Up Synthetic Biology. We then show that the best screened candidate can indeed completely substitute the wildtype gene in Escherichia coli. These results raise great hopes for the next level of synthetic biology, where ML-designed synthetic proteins will be used to engineer cellular functions.

SUBMITTER: Kohyama S 

PROVIDER: S-EPMC10914801 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning-aided design and screening of an emergent protein function in synthetic cells.

Kohyama Shunshi S   Frohn Béla P BP   Babl Leon L   Schwille Petra P  

Nature communications 20240305 1


Recently, utilization of Machine Learning (ML) has led to astonishing progress in computational protein design, bringing into reach the targeted engineering of proteins for industrial and biomedical applications. However, the design of proteins for emergent functions of core relevance to cells, such as the ability to spatiotemporally self-organize and thereby structure the cellular space, is still extremely challenging. While on the generative side conditional generative models and multi-state d  ...[more]

Similar Datasets

| S-EPMC8314522 | biostudies-literature
| S-EPMC7739401 | biostudies-literature
| S-EPMC8941095 | biostudies-literature
2021-06-01 | GSE171549 | GEO
2021-07-26 | GSE175955 | GEO
| S-EPMC11755723 | biostudies-literature
| S-EPMC9843531 | biostudies-literature
| S-EPMC10403280 | biostudies-literature
| S-EPMC9136609 | biostudies-literature
2023-05-16 | GSE232161 | GEO