Unknown

Dataset Information

0

Using inferred residue contacts to distinguish between correct and incorrect protein models.


ABSTRACT:

Motivation

The de novo prediction of 3D protein structure is enjoying a period of dramatic improvements. Often, a remaining difficulty is to select the model closest to the true structure from a group of low-energy candidates. To what extent can inter-residue contact predictions from multiple sequence alignments, information which is orthogonal to that used in most structure prediction algorithms, be used to identify those models most similar to the native protein structure?

Results

We present a Bayesian inference procedure to identify residue pairs that are spatially proximal in a protein structure. The method takes as input a multiple sequence alignment, and outputs an accurate posterior probability of proximity for each residue pair. We exploit a recent metagenomic sequencing project to create large, diverse and informative multiple sequence alignments for a test set of 1656 known protein structures. The method infers spatially proximal residue pairs in this test set with good accuracy: top-ranked predictions achieve an average accuracy of 38% (for an average 21-fold improvement over random predictions) in cross-validation tests. Notably, the accuracy of predicted 3D models generated by a range of structure prediction algorithms strongly correlates with how well the models satisfy probable residue contacts inferred via our method. This correlation allows for confident rejection of incorrect structural models.

Availability

An implementation of the method is freely available at http://www.doe-mbi.ucla.edu/services.

SUBMITTER: Miller CS 

PROVIDER: S-EPMC2638260 | biostudies-literature | 2008 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using inferred residue contacts to distinguish between correct and incorrect protein models.

Miller Christopher S CS   Eisenberg David D  

Bioinformatics (Oxford, England) 20080529 14


<h4>Motivation</h4>The de novo prediction of 3D protein structure is enjoying a period of dramatic improvements. Often, a remaining difficulty is to select the model closest to the true structure from a group of low-energy candidates. To what extent can inter-residue contact predictions from multiple sequence alignments, information which is orthogonal to that used in most structure prediction algorithms, be used to identify those models most similar to the native protein structure?<h4>Results</  ...[more]

Similar Datasets

| S-EPMC5628397 | biostudies-literature
| S-EPMC5799025 | biostudies-literature
| S-EPMC9923443 | biostudies-literature
| S-EPMC3509494 | biostudies-literature
| S-EPMC2677742 | biostudies-literature
| S-EPMC6419322 | biostudies-literature
| S-EPMC6332208 | biostudies-literature
| S-EPMC4908341 | biostudies-literature
| S-EPMC4894841 | biostudies-literature
| S-EPMC3324774 | biostudies-literature