Dataset Information

Incorporation of evolutionary information into Rosetta comparative modeling.

ABSTRACT: Prediction of protein structures from sequences is a fundamental problem in computational biology. Algorithms that attempt to predict a structure from sequence primarily use two sources of information. The first source is physical in nature: proteins fold into their lowest energy state. Given an energy function that describes the interactions governing folding, a method for constructing models of protein structures, and the amino acid sequence of a protein of interest, the structure prediction problem becomes a search for the lowest energy structure. Evolution provides an orthogonal source of information: proteins of similar sequences have similar structure, and therefore proteins of known structure can guide modeling. The relatively successful Rosetta approach takes advantage of the first, but not the second source of information during model optimization. Following the classic work by Andrej Sali and colleagues, we develop a probabilistic approach to derive spatial restraints from proteins of known structure using advances in alignment technology and the growth in the number of structures in the Protein Data Bank. These restraints define a region of conformational space that is high-probability, given the template information, and we incorporate them into Rosetta's comparative modeling protocol. The combined approach performs considerably better on a benchmark based on previous CASP experiments. Incorporating evolutionary information into Rosetta is analogous to incorporating sparse experimental data: in both cases, the additional information eliminates large regions of conformational space and increases the probability that energy-based refinement will hone in on the deep energy minimum at the native state.

SUBMITTER: Thompson J

PROVIDER: S-EPMC3538865 | biostudies-literature | 2011 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Incorporation of evolutionary information into Rosetta comparative modeling.

Thompson James J Baker David D

Proteins 20110602 8

Prediction of protein structures from sequences is a fundamental problem in computational biology. Algorithms that attempt to predict a structure from sequence primarily use two sources of information. The first source is physical in nature: proteins fold into their lowest energy state. Given an energy function that describes the interactions governing folding, a method for constructing models of protein structures, and the amino acid sequence of a protein of interest, the structure prediction p ...[more]

PMID: 21638331

Dataset Information

Incorporation of evolutionary information into Rosetta comparative modeling.

Publications

Incorporation of evolutionary information into Rosetta comparative modeling.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Rosetta design with co-evolutionary information retains protein function.
| S-EPMC7815116 | biostudies-literature

Comparative modeling and docking of chemokine-receptor interactions with Rosetta.
| S-EPMC7295663 | biostudies-literature

Using Rosetta for RNA homology modeling.
| S-EPMC7932369 | biostudies-literature

Modeling membrane geometries implicitly in Rosetta.
| S-EPMC10868433 | biostudies-literature

Modeling disordered regions in proteins using Rosetta.
| S-EPMC3146542 | biostudies-literature

Cross-link guided molecular modeling with ROSETTA.
| S-EPMC3775805 | biostudies-literature

Web-accessible molecular modeling with Rosetta: The Rosetta Online Server that Includes Everyone (ROSIE).
| S-EPMC5734271 | biostudies-literature

Modeling and docking of antibody structures with Rosetta.
| S-EPMC5739521 | biostudies-literature

PRosettaC: Rosetta Based Modeling of PROTAC Mediated Ternary Complexes.
| S-EPMC7592117 | biostudies-literature

Structural modeling of hERG channel-drug interactions using Rosetta.
| S-EPMC10682396 | biostudies-literature