Dataset Information

Sampling bottlenecks in de novo protein structure prediction.

ABSTRACT: The primary obstacle to de novo protein structure prediction is conformational sampling: the native state generally has lower free energy than nonnative structures but is exceedingly difficult to locate. Structure predictions with atomic level accuracy have been made for small proteins using the Rosetta structure prediction method, but for larger and more complex proteins, the native state is virtually never sampled, and it has been unclear how much of an increase in computing power would be required to successfully predict the structures of such proteins. In this paper, we develop an approach to determining how much computer power is required to accurately predict the structure of a protein, based on a reformulation of the conformational search problem as a combinatorial sampling problem in a discrete feature space. We find that conformational sampling for many proteins is limited by critical "linchpin" features, often the backbone torsion angles of individual residues, which are sampled very rarely in unbiased trajectories and, when constrained, dramatically increase the sampling of the native state. These critical features frequently occur in less regular and likely strained regions of proteins that contribute to protein function. In a number of proteins, the linchpin features are in regions found experimentally to form late in folding, suggesting a correspondence between folding in silico and in reality.

SUBMITTER: Kim DE

PROVIDER: S-EPMC2760740 | biostudies-literature | 2009 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Sampling bottlenecks in de novo protein structure prediction.

Kim David E DE Blum Ben B Bradley Philip P Baker David D

Journal of molecular biology 20090728 1

The primary obstacle to de novo protein structure prediction is conformational sampling: the native state generally has lower free energy than nonnative structures but is exceedingly difficult to locate. Structure predictions with atomic level accuracy have been made for small proteins using the Rosetta structure prediction method, but for larger and more complex proteins, the native state is virtually never sampled, and it has been unclear how much of an increase in computing power would be req ...[more]

PMID: 19646450

Dataset Information

Sampling bottlenecks in de novo protein structure prediction.

Publications

Sampling bottlenecks in de novo protein structure prediction.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling.
| S-EPMC5018369 | biostudies-literature

Building a better fragment library for de novo protein structure prediction.
| S-EPMC4406757 | biostudies-literature

De novo protein structure prediction using ultra-fast molecular dynamics simulation.
| S-EPMC6245515 | biostudies-literature

LZerD Protein-Protein Docking Webserver Enhanced With <i>de novo</i> Structure Prediction.
| S-EPMC8403062 | biostudies-literature

De novo protein design by inversion of the AlphaFold structure prediction network.
| S-EPMC10204179 | biostudies-literature

Generalized ensemble methods for de novo structure prediction.
| S-EPMC2631076 | biostudies-literature

De novo protein conformational sampling using a probabilistic graphical model.
| S-EPMC4635387 | biostudies-literature

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.
| S-EPMC6030820 | biostudies-literature

De novo prediction of protein folding pathways and structure using the principle of sequential stabilization.
| S-EPMC3491489 | biostudies-literature

Bihelix: Towards de novo structure prediction of an ensemble of G-protein coupled receptor conformations.
| S-EPMC3310341 | biostudies-literature