Unknown

Dataset Information

0

Improving predicted protein loop structure ranking using a Pareto-optimality consensus method.


ABSTRACT: Accurate protein loop structure models are important to understand functions of many proteins. Identifying the native or near-native models by distinguishing them from the misfolded ones is a critical step in protein loop structure prediction.We have developed a Pareto Optimal Consensus (POC) method, which is a consensus model ranking approach to integrate multiple knowledge- or physics-based scoring functions. The procedure of identifying the models of best quality in a model set includes: 1) identifying the models at the Pareto optimal front with respect to a set of scoring functions, and 2) ranking them based on the fuzzy dominance relationship to the rest of the models. We apply the POC method to a large number of decoy sets for loops of 4- to 12-residue in length using a functional space composed of several carefully-selected scoring functions: Rosetta, DOPE, DDFIRE, OPLS-AA, and a triplet backbone dihedral potential developed in our lab. Our computational results show that the sets of Pareto-optimal decoys, which are typically composed of approximately 20% or less of the overall decoys in a set, have a good coverage of the best or near-best decoys in more than 99% of the loop targets. Compared to the individual scoring function yielding best selection accuracy in the decoy sets, the POC method yields 23%, 37%, and 64% less false positives in distinguishing the native conformation, indentifying a near-native model (RMSD < 0.5A from the native) as top-ranked, and selecting at least one near-native model in the top-5-ranked models, respectively. Similar effectiveness of the POC method is also found in the decoy sets from membrane protein loops. Furthermore, the POC method outperforms the other popularly-used consensus strategies in model ranking, such as rank-by-number, rank-by-rank, rank-by-vote, and regression-based methods.By integrating multiple knowledge- and physics-based scoring functions based on Pareto optimality and fuzzy dominance, the POC method is effective in distinguishing the best loop models from the other ones within a loop model set.

SUBMITTER: Li Y 

PROVIDER: S-EPMC2914074 | biostudies-literature | 2010 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving predicted protein loop structure ranking using a Pareto-optimality consensus method.

Li Yaohang Y   Rata Ionel I   Chiu See-wing SW   Jakobsson Eric E  

BMC structural biology 20100720


<h4>Background</h4>Accurate protein loop structure models are important to understand functions of many proteins. Identifying the native or near-native models by distinguishing them from the misfolded ones is a critical step in protein loop structure prediction.<h4>Results</h4>We have developed a Pareto Optimal Consensus (POC) method, which is a consensus model ranking approach to integrate multiple knowledge- or physics-based scoring functions. The procedure of identifying the models of best qu  ...[more]

Similar Datasets

| S-EPMC4150305 | biostudies-literature
| S-EPMC6384925 | biostudies-literature
| S-EPMC4605829 | biostudies-literature
| S-EPMC2662860 | biostudies-literature
| S-EPMC8478077 | biostudies-literature
| S-EPMC8413283 | biostudies-literature
| S-EPMC8163773 | biostudies-literature
| S-EPMC3614904 | biostudies-literature
| S-EPMC8595106 | biostudies-literature
| S-EPMC2142642 | biostudies-other