Unknown

Dataset Information

0

Folding free energy function selects native-like protein sequences in the core but not on the surface.


ABSTRACT: An automatic protein design procedure is used to select amino acid sequences that optimize the folding free energy function for a given protein. The only information used in designing the sequences is a set of known backbone structures for each protein, a rotamer library, and a well established classical empirical force field, which relies on basic physical chemical principles that underlie molecular interactions and protein stability, and has not been adjusted to yield native-like sequences. Applying the procedure to 7 different known protein folds, representing a total of 45 different native protein structures, yields ensembles of designed sequences displaying remarkable similarity to their natural counterparts in the protein core, but which are distinctly non-native on the protein surface. We show that natural and designed sequences for a given fold score significantly higher than random sequences against profiles derived from both, designed and natural sequence ensembles. Furthermore, we find that designed sequence profiles can be used to retrieve the native sequences for many of the analyzed proteins using standard PSI-BLAST searches in sequence databases. These findings may have important implications for our understanding the selection pressures operating on natural protein sequences and hold promise for improving fold recognition.

SUBMITTER: Jaramillo A 

PROVIDER: S-EPMC129712 | biostudies-literature | 2002 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Folding free energy function selects native-like protein sequences in the core but not on the surface.

Jaramillo Alfonso A   Wernisch Lorenz L   Héry Stéphanie S   Wodak Shoshana J SJ  

Proceedings of the National Academy of Sciences of the United States of America 20021004 21


An automatic protein design procedure is used to select amino acid sequences that optimize the folding free energy function for a given protein. The only information used in designing the sequences is a set of known backbone structures for each protein, a rotamer library, and a well established classical empirical force field, which relies on basic physical chemical principles that underlie molecular interactions and protein stability, and has not been adjusted to yield native-like sequences. Ap  ...[more]

Similar Datasets

| S-EPMC18029 | biostudies-literature
| S-EPMC2700257 | biostudies-literature
| S-EPMC1865387 | biostudies-literature
| S-EPMC522040 | biostudies-literature
| S-EPMC4968343 | biostudies-literature
| S-EPMC7889939 | biostudies-literature
| S-EPMC2527276 | biostudies-other
| S-EPMC2671193 | biostudies-literature
| S-EPMC5153537 | biostudies-literature