Unknown

Dataset Information

0

Accelerating molecular simulations of proteins using Bayesian inference on weak information.


ABSTRACT: Atomistic molecular dynamics (MD) simulations of protein molecules are too computationally expensive to predict most native structures from amino acid sequences. Here, we integrate "weak" external knowledge into folding simulations to predict protein structures, given their sequence. For example, we instruct the computer "to form a hydrophobic core," "to form good secondary structures," or "to seek a compact state." This kind of information has been too combinatoric, nonspecific, and vague to help guide MD simulations before. Within atomistic replica-exchange molecular dynamics (REMD), we develop a statistical mechanical framework, modeling using limited data with coarse physical insight(s) (MELD + CPI), for harnessing weak information. As a test, we apply MELD + CPI to predict the native structures of 20 small proteins. MELD + CPI samples to within less than 3.2 Å from native for all 20 and correctly chooses the native structures (<4 Å) for 15 of them, including ubiquitin, a millisecond folder. MELD + CPI is up to five orders of magnitude faster than brute-force MD, satisfies detailed balance, and should scale well to larger proteins. MELD + CPI may be useful where physics-based simulations are needed to study protein mechanisms and populations and where we have some heuristic or coarse physical knowledge about states of interest.

SUBMITTER: Perez A 

PROVIDER: S-EPMC4586851 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accelerating molecular simulations of proteins using Bayesian inference on weak information.

Perez Alberto A   MacCallum Justin L JL   Dill Ken A KA  

Proceedings of the National Academy of Sciences of the United States of America 20150908 38


Atomistic molecular dynamics (MD) simulations of protein molecules are too computationally expensive to predict most native structures from amino acid sequences. Here, we integrate "weak" external knowledge into folding simulations to predict protein structures, given their sequence. For example, we instruct the computer "to form a hydrophobic core," "to form good secondary structures," or "to seek a compact state." This kind of information has been too combinatoric, nonspecific, and vague to he  ...[more]

Similar Datasets

| S-EPMC5408833 | biostudies-other
| S-EPMC10355842 | biostudies-literature
| S-EPMC10081221 | biostudies-literature
| S-EPMC10491301 | biostudies-literature
| S-EPMC7512506 | biostudies-literature
| S-EPMC3098374 | biostudies-literature
| S-EPMC5612641 | biostudies-literature
| S-EPMC8964751 | biostudies-literature
| S-EPMC3274721 | biostudies-literature
| S-EPMC4796016 | biostudies-literature