Unknown

Dataset Information

0

Design of metalloproteins and novel protein folds using variational autoencoders.


ABSTRACT: The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks.

SUBMITTER: Greener JG 

PROVIDER: S-EPMC6212568 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Design of metalloproteins and novel protein folds using variational autoencoders.

Greener Joe G JG   Moffat Lewis L   Jones David T DT  

Scientific reports 20181101 1


The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal bindi  ...[more]

Similar Datasets

| S-EPMC8605902 | biostudies-literature
| S-EPMC7067240 | biostudies-literature
| S-EPMC7946179 | biostudies-literature
| S-EPMC9813669 | biostudies-literature
| S-EPMC6917668 | biostudies-literature
| S-EPMC7973750 | biostudies-literature
| S-EPMC9374267 | biostudies-literature
| S-EPMC8489729 | biostudies-literature
| S-EPMC7775287 | biostudies-literature
| S-EPMC10590447 | biostudies-literature