Unknown

Dataset Information

0

Prediction of protein folding class using global description of amino acid sequence.


ABSTRACT: We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was performed on 15 of the largest classes. The test shows that proteins were assigned to the correct class (correct positive prediction) with an average accuracy of 71.7%, whereas the inverse prediction of proteins as not belonging to a particular class (correct negative prediction) was 90-95% accurate. When tested on 254 structures used in this study, the top two predictions contained the correct class in 91% of the cases.

SUBMITTER: Dubchak I 

PROVIDER: S-EPMC41034 | biostudies-other | 1995 Sep

REPOSITORIES: biostudies-other

altmetric image

Publications

Prediction of protein folding class using global description of amino acid sequence.

Dubchak I I   Muchnik I I   Holbrook S R SR   Kim S H SH  

Proceedings of the National Academy of Sciences of the United States of America 19950901 19


We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was p  ...[more]

Similar Datasets

| S-EPMC1538837 | biostudies-literature
| S-EPMC428451 | biostudies-literature
| S-EPMC2441995 | biostudies-literature
| S-EPMC9106047 | biostudies-literature
| S-EPMC1933209 | biostudies-literature
| S-EPMC1277819 | biostudies-literature
| S-EPMC4047675 | biostudies-literature
| S-EPMC6486773 | biostudies-literature
| S-EPMC2975416 | biostudies-literature
| S-EPMC3245509 | biostudies-literature