Unknown

Dataset Information

0

Efficient sampling in fragment-based protein structure prediction using an estimation of distribution algorithm.


ABSTRACT: Fragment assembly is a powerful method of protein structure prediction that builds protein models from a pool of candidate fragments taken from known structures. Stochastic sampling is subsequently used to refine the models. The structures are first represented as coarse-grained models and then as all-atom models for computational efficiency. Many models have to be generated independently due to the stochastic nature of the sampling methods used to search for the global minimum in a complex energy landscape. In this paper we present EdaFold(AA), a fragment-based approach which shares information between the generated models and steers the search towards native-like regions. A distribution over fragments is estimated from a pool of low energy all-atom models. This iteratively-refined distribution is used to guide the selection of fragments during the building of models for subsequent rounds of structure prediction. The use of an estimation of distribution algorithm enabled EdaFold(AA) to reach lower energy levels and to generate a higher percentage of near-native models. [Formula: see text] uses an all-atom energy function and produces models with atomic resolution. We observed an improvement in energy-driven blind selection of models on a benchmark of EdaFold(AA) in comparison with the [Formula: see text] AbInitioRelax protocol.

SUBMITTER: Simoncini D 

PROVIDER: S-EPMC3723781 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient sampling in fragment-based protein structure prediction using an estimation of distribution algorithm.

Simoncini David D   Zhang Kam Y J KY  

PloS one 20130725 7


Fragment assembly is a powerful method of protein structure prediction that builds protein models from a pool of candidate fragments taken from known structures. Stochastic sampling is subsequently used to refine the models. The structures are first represented as coarse-grained models and then as all-atom models for computational efficiency. Many models have to be generated independently due to the stochastic nature of the sampling methods used to search for the global minimum in a complex ener  ...[more]

Similar Datasets

| S-EPMC297010 | biostudies-literature
| S-EPMC3240822 | biostudies-literature
| S-EPMC4781684 | biostudies-literature
| S-EPMC5041553 | biostudies-literature
| S-EPMC6030820 | biostudies-other
| S-EPMC1304563 | biostudies-literature
2022-05-16 | GSE189510 | GEO
| S-EPMC3030773 | biostudies-literature
| S-EPMC2760740 | biostudies-literature