Unknown

Dataset Information

0

Data driven flexible backbone protein design.


ABSTRACT: Protein design remains an important problem in computational structural biology. Current computational protein design methods largely use physics-based methods, which make use of information from a single protein structure. This is despite the fact that multiple structures of many protein folds are now readily available in the PDB. While ensemble protein design methods can use multiple protein structures, they treat each structure independently. Here, we introduce a flexible backbone strategy, FlexiBaL-GP, which learns global protein backbone movements directly from multiple protein structures. FlexiBaL-GP uses the machine learning method of Gaussian Process Latent Variable Models to learn a lower dimensional representation of the protein coordinates that best represent backbone movements. These learned backbone movements are used to explore alternative protein backbones, while engineering a protein within a parallel tempered MCMC framework. Using the human ubiquitin-USP21 complex as a model we demonstrate that our design strategy outperforms current strategies for the interface design task of identifying tight binding ubiquitin variants for USP21.

SUBMITTER: Sun MGF 

PROVIDER: S-EPMC5587332 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Data driven flexible backbone protein design.

Sun Mark G F MGF   Kim Philip M PM  

PLoS computational biology 20170824 8


Protein design remains an important problem in computational structural biology. Current computational protein design methods largely use physics-based methods, which make use of information from a single protein structure. This is despite the fact that multiple structures of many protein folds are now readily available in the PDB. While ensemble protein design methods can use multiple protein structures, they treat each structure independently. Here, we introduce a flexible backbone strategy, F  ...[more]

Similar Datasets

| S-EPMC3166072 | biostudies-literature
| S-EPMC3750959 | biostudies-literature
| S-EPMC6901717 | biostudies-literature
| S-EPMC2896185 | biostudies-literature
| S-EPMC2774439 | biostudies-literature
| S-EPMC3138746 | biostudies-literature
| S-EPMC3372604 | biostudies-literature
| S-EPMC7614567 | biostudies-literature
| S-EPMC6184633 | biostudies-literature
| S-EPMC3426558 | biostudies-literature