Dataset Information

Selecting Near-Native Protein Structures from Predicted Decoy Sets Using Ordered Graphlet Degree Similarity.

ABSTRACT: Effective prediction of protein tertiary structure from sequence is an important and challenging problem in computational structural biology. Ab initio protein structure prediction is based on amino acid sequence alone, thus, it has a wide application area. With the ab initio method, a large number of candidate protein structures called decoy set can be predicted, however, it is a difficult problem to select a good near-native structure from the predicted decoy set. In this work we propose a new method for selecting the near-native structure from the decoy set based on both contact map overlap (CMO) and graphlets. By generalizing graphlets to ordered graphs, and using a dynamic programming to select the optimal alignment with an introduced gap penalty, a GR_score is defined for calculating the similarity between the three-dimensional (3D) decoy structures. The proposed method was applied to all 54 single-domain targets in CASP11 and all 43 targets in CASP10, and ensemble clustering was used to cluster the protein decoy structures based on the computed CR_scores. The most popular centroid structure was selected as the near-native structure. The experiments showed that compared to the SPICKER method, which is used in I-TASSER, the proposed method can usually select better near-native structures in terms of the similarity between the selected structure and the true native structure.

SUBMITTER: Han X

PROVIDER: S-EPMC6410076 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Selecting Near-Native Protein Structures from Predicted Decoy Sets Using Ordered Graphlet Degree Similarity.

Han Xu X Li Li L Lu Yonggang Y

Genes 20190211 2

Effective prediction of protein tertiary structure from sequence is an important and challenging problem in computational structural biology. Ab initio protein structure prediction is based on amino acid sequence alone, thus, it has a wide application area. With the ab initio method, a large number of candidate protein structures called decoy set can be predicted, however, it is a difficult problem to select a good near-native structure from the predicted decoy set. In this work we propose a new ...[more]

PMID: 30754721

Dataset Information

Selecting Near-Native Protein Structures from Predicted Decoy Sets Using Ordered Graphlet Degree Similarity.

Publications

Selecting Near-Native Protein Structures from Predicted Decoy Sets Using Ordered Graphlet Degree Similarity.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Density-based score for selecting near-native atomic models of unknown structures.
| S-EPMC2175034 | biostudies-literature

Detecting local residue environment similarity for recognizing near-native structure models.
| S-EPMC4237674 | biostudies-literature

Methods of model accuracy estimation can help selecting the best models from decoy sets: Assessment of model accuracy estimations in CASP11.
| S-EPMC4781682 | biostudies-literature

Using graphlet degree vectors to predict atomic displacement parameters in protein structures.
| S-EPMC10833351 | biostudies-literature

Splitting statistical potentials into meaningful scoring functions: testing the prediction of near-native structures from decoy conformations.
| S-EPMC2783033 | biostudies-literature

Model selection over partially ordered sets.
| S-EPMC10895251 | biostudies-literature

Eye lens β-crystallins are predicted by native ion mobility-mass spectrometry and computations to form compact higher-ordered heterooligomers.
| S-EPMC10528727 | biostudies-literature

A phylogenomics approach for selecting robust sets of phylogenetic markers.
| S-EPMC3985644 | biostudies-literature

Similarity searches in genome-wide numerical data sets.
| S-EPMC1489924 | biostudies-literature

Protein complex compositions predicted by structural similarity.
| S-EPMC1474056 | biostudies-literature