Unknown

Dataset Information

0

Thermodynamic propensities of amino acids in the native state ensemble: implications for fold recognition.


ABSTRACT: An amino acid sequence, in the context of the solvent environment, contains all of the thermodynamic information necessary to encode a three-dimensional protein structure. To investigate the relationship between an amino acid sequence and its corresponding protein fold, a database of thermodynamic stability information was assembled that spanned 2951 residues from 44 nonhomologous proteins. This information was obtained using the COREX algorithm, which computes an ensemble-based description of the native state of a protein. It was observed that amino acid types partitioned unequally into high, medium, and low thermodynamic stability environments. Furthermore, these distributions were reproducible and were significantly different than those expected from random partitioning. To assess the structural importance of the distributions, simple fold-recognition experiments were performed based on a 3D-1D scoring matrix containing only COREX residue stability information. This procedure was able to recover amino acid sequences corresponding to correct target structures more effectively than scoring matrices derived from randomized data. High-scoring sequences were often aligned correctly with their corresponding target profiles, suggesting that calculated thermodynamic stability profiles have the potential to encode sequence information. As a control, identical fold-recognition experiments were performed on the same database of proteins using DSSP secondary structure information in the scoring matrix, instead of COREX residue stability information. The comparable performance of both approaches suggested that COREX residue stability information and secondary structure information could be of equivalent utility in more sophisticated fold-recognition techniques. The results of this work are a consequence of the idea that amino acid sequences fold not into single, rigidly stable structures but rather into thermodynamic ensembles best represented by a time-averaged structure.

SUBMITTER: Wrabl JO 

PROVIDER: S-EPMC2374190 | biostudies-literature | 2001 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Thermodynamic propensities of amino acids in the native state ensemble: implications for fold recognition.

Wrabl J O JO   Larson S A SA   Hilser V J VJ  

Protein science : a publication of the Protein Society 20010501 5


An amino acid sequence, in the context of the solvent environment, contains all of the thermodynamic information necessary to encode a three-dimensional protein structure. To investigate the relationship between an amino acid sequence and its corresponding protein fold, a database of thermodynamic stability information was assembled that spanned 2951 residues from 44 nonhomologous proteins. This information was obtained using the COREX algorithm, which computes an ensemble-based description of t  ...[more]

Similar Datasets

| S-EPMC122186 | biostudies-literature
| S-EPMC3747235 | biostudies-literature
| S-EPMC3315449 | biostudies-literature
| S-EPMC2955036 | biostudies-literature
| S-EPMC3495713 | biostudies-literature
| S-EPMC8768454 | biostudies-literature
| S-EPMC6095142 | biostudies-other
| S-EPMC5425102 | biostudies-literature
| S-EPMC2280067 | biostudies-literature