Dataset Information

Forecasting residue-residue contact prediction accuracy.

ABSTRACT:

Motivation

Apart from meta-predictors, most of today's methods for residue-residue contact prediction are based entirely on Direct Coupling Analysis (DCA) of correlated mutations in multiple sequence alignments (MSAs). These methods are on average ∼40% correct for the 100 strongest predicted contacts in each protein. The end-user who works on a single protein of interest will not know if predictions are either much more or much less correct than 40%, which is especially a problem if contacts are predicted to steer experimental research on that protein.

Results

We designed a regression model that forecasts the accuracy of residue-residue contact prediction for individual proteins with an average error of 7 percentage points. Contacts were predicted with two DCA methods (gplmDCA and PSICOV). The models were built on parameters that describe the MSA, the predicted secondary structure, the predicted solvent accessibility and the contact prediction scores for the target protein. Results show that our models can be also applied to the meta-methods, which was tested on RaptorX.

Availability and implementation

All data and scripts are available from http://comprec-lin.iiar.pwr.edu.pl/dcaQ/.

Contact

malgorzata.kotulska@pwr.edu.pl.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Wozniak PP

PROVIDER: S-EPMC5860164 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Forecasting residue-residue contact prediction accuracy.

Wozniak P P PP Konopka B M BM Xu J J Vriend G G Kotulska M M

Bioinformatics (Oxford, England) 20171101 21

<h4>Motivation</h4>Apart from meta-predictors, most of today's methods for residue-residue contact prediction are based entirely on Direct Coupling Analysis (DCA) of correlated mutations in multiple sequence alignments (MSAs). These methods are on average ∼40% correct for the 100 strongest predicted contacts in each protein. The end-user who works on a single protein of interest will not know if predictions are either much more or much less correct than 40%, which is especially a problem if cont ...[more]

PMID: 29036497

Dataset Information

Forecasting residue-residue contact prediction accuracy.

Motivation

Results

Availability and implementation

Contact

Supplementary information

Publications

Forecasting residue-residue contact prediction accuracy.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.
| S-EPMC8305966 | biostudies-literature

Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.
| S-EPMC6237422 | biostudies-literature

RRCRank: a fusion method using rank strategy for residue-residue contact prediction.
| S-EPMC5581475 | biostudies-literature

Characteristics of protein residue-residue contacts and their application in contact prediction.
| S-EPMC4221654 | biostudies-literature

Prediction of inter-residue contact clusters from hydrophobic cores.
| S-EPMC2929137 | biostudies-literature

Enhanced Inter-helical Residue Contact Prediction in Transmembrane Proteins.
| S-EPMC3164537 | biostudies-literature

DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.
| S-EPMC6324825 | biostudies-literature

Towards accurate residue-residue hydrophobic contact prediction for alpha helical proteins via integer linear optimization.
| S-EPMC2635923 | biostudies-literature

OMPcontact: An Outer Membrane Protein Inter-Barrel Residue Contact Prediction Method.
| S-EPMC5346958 | biostudies-literature

Residue contact-count potentials are as effective as residue-residue contact-type potentials for ranking protein decoys.
| S-EPMC2642821 | biostudies-literature