Unknown

Dataset Information

0

Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.


ABSTRACT: De novo membrane protein structure prediction is limited to small proteins due to the conformational search space quickly expanding with length. Long-range contacts (24+ amino acid separation)-residue positions distant in sequence, but in close proximity in the structure, are arguably the most effective way to restrict this conformational space. Inverse methods for co-evolutionary analysis predict a global set of position-pair couplings that best explain the observed amino acid co-occurrences, thus distinguishing between evolutionarily explained co-variances and these arising from spurious transitive effects. Here, we show that applying machine learning approaches and custom descriptors improves evolutionary contact prediction accuracy, resulting in improvement of average precision by 6 percentage points for the top 1L non-local contacts. Further, we demonstrate that predicted contacts improve protein folding with BCL::Fold. The mean RMSD100 metric for the top 10 models folded was reduced by an average of 2 Å for a benchmark of 25 membrane proteins.

SUBMITTER: Teixeira PL 

PROVIDER: S-EPMC5443516 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.

Teixeira Pedro L PL   Mendenhall Jeff L JL   Heinze Sten S   Weiner Brian B   Skwark Marcin J MJ   Meiler Jens J  

PloS one 20170524 5


De novo membrane protein structure prediction is limited to small proteins due to the conformational search space quickly expanding with length. Long-range contacts (24+ amino acid separation)-residue positions distant in sequence, but in close proximity in the structure, are arguably the most effective way to restrict this conformational space. Inverse methods for co-evolutionary analysis predict a global set of position-pair couplings that best explain the observed amino acid co-occurrences, t  ...[more]

Similar Datasets

2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
2013-01-01 | GSE29210 | GEO
| S-EPMC8199773 | biostudies-literature
| S-EPMC8340610 | biostudies-literature
| S-EPMC9307832 | biostudies-literature
| S-EPMC9281391 | biostudies-literature
| S-EPMC6241126 | biostudies-other
| S-ECPF-GEOD-29210 | biostudies-other
| S-EPMC8100175 | biostudies-literature