Unknown

Dataset Information

0

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.


ABSTRACT: MOTIVATION: Recent developments of statistical techniques to infer direct evolutionary couplings between residue pairs have rendered covariation-based contact prediction a viable means for accurate 3D modelling of proteins, with no information other than the sequence required. To extend the usefulness of contact prediction, we have designed a new meta-predictor (MetaPSICOV) which combines three distinct approaches for inferring covariation signals from multiple sequence alignments, considers a broad range of other sequence-derived features and, uniquely, a range of metrics which describe both the local and global quality of the input multiple sequence alignment. Finally, we use a two-stage predictor, where the second stage filters the output of the first stage. This two-stage predictor is additionally evaluated on its ability to accurately predict the long range network of hydrogen bonds, including correctly assigning the donor and acceptor residues. RESULTS: Using the original PSICOV benchmark set of 150 protein families, MetaPSICOV achieves a mean precision of 0.54 for top-L predicted long range contacts-around 60% higher than PSICOV, and around 40% better than CCMpred. In de novo protein structure prediction using FRAGFOLD, MetaPSICOV is able to improve the TM-scores of models by a median of 0.05 compared with PSICOV. Lastly, for predicting long range hydrogen bonding, MetaPSICOV-HB achieves a precision of 0.69 for the top-L/10 hydrogen bonds compared with just 0.26 for the baseline MetaPSICOV. AVAILABILITY AND IMPLEMENTATION: MetaPSICOV is available as a freely available web server at http://bioinf.cs.ucl.ac.uk/MetaPSICOV. Raw data (predicted contact lists and 3D models) and source code can be downloaded from http://bioinf.cs.ucl.ac.uk/downloads/MetaPSICOV. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

SUBMITTER: Jones DT 

PROVIDER: S-EPMC4382908 | biostudies-other | 2015 Apr

REPOSITORIES: biostudies-other

altmetric image

Publications

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Jones David T DT   Singh Tanya T   Kosciolek Tomasz T   Tetchner Stuart S  

Bioinformatics (Oxford, England) 20141126 7


<h4>Motivation</h4>Recent developments of statistical techniques to infer direct evolutionary couplings between residue pairs have rendered covariation-based contact prediction a viable means for accurate 3D modelling of proteins, with no information other than the sequence required. To extend the usefulness of contact prediction, we have designed a new meta-predictor (MetaPSICOV) which combines three distinct approaches for inferring covariation signals from multiple sequence alignments, consid  ...[more]

Similar Datasets

| S-EPMC4894841 | biostudies-literature
| S-EPMC6442609 | biostudies-literature
| S-EPMC6057941 | biostudies-literature
| S-EPMC2873825 | biostudies-literature
| S-EPMC4145820 | biostudies-literature
| S-EPMC4615339 | biostudies-other
| S-EPMC8425427 | biostudies-literature
| S-EPMC4390092 | biostudies-literature
| S-EPMC6149874 | biostudies-literature
| S-EPMC3042383 | biostudies-literature