Protein stability: a single recorded mutation aids in predicting the effects of other mutations in the same amino acid site.
Ontology highlight
ABSTRACT: MOTIVATION: Accurate prediction of protein stability is important for understanding the molecular underpinnings of diseases and for the design of new proteins. We introduce a novel approach for the prediction of changes in protein stability that arise from a single-site amino acid substitution; the approach uses available data on mutations occurring in the same position and in other positions. Our algorithm, named Pro-Maya (Protein Mutant stAbilitY Analyzer), combines a collaborative filtering baseline model, Random Forests regression and a diverse set of features. Pro-Maya predicts the stability free energy difference of mutant versus wild type, denoted as ??G. RESULTS: We evaluated our algorithm extensively using cross-validation on two previously utilized datasets of single amino acid mutations and a (third) validation set. The results indicate that using known ??G values of mutations at the query position improves the accuracy of ??G predictions for other mutations in that position. The accuracy of our predictions in such cases significantly surpasses that of similar methods, achieving, e.g. a Pearson's correlation coefficient of 0.79 and a root mean square error of 0.96 on the validation set. Because Pro-Maya uses a diverse set of features, including predictions using two other methods, it also performs slightly better than other methods in the absence of additional experimental data on the query positions. AVAILABILITY: Pro-Maya is freely available via web server at http://bental.tau.ac.il/ProMaya. CONTACT: nirb@tauex.tau.ac.il; wolf@cs.tau.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
SUBMITTER: Wainreb G
PROVIDER: S-EPMC3223369 | biostudies-literature | 2011 Dec
REPOSITORIES: biostudies-literature
ACCESS DATA