Unknown

Dataset Information

0

Predicting residue-residue contacts and helix-helix interactions in transmembrane proteins using an integrative feature-based random forest approach.


ABSTRACT: Integral membrane proteins constitute 25-30% of genomes and play crucial roles in many biological processes. However, less than 1% of membrane protein structures are in the Protein Data Bank. In this context, it is important to develop reliable computational methods for predicting the structures of membrane proteins. Here, we present the first application of random forest (RF) for residue-residue contact prediction in transmembrane proteins, which we term as TMhhcp. Rigorous cross-validation tests indicate that the built RF models provide a more favorable prediction performance compared with two state-of-the-art methods, i.e., TMHcon and MEMPACK. Using a strict leave-one-protein-out jackknifing procedure, they were capable of reaching the top L/5 prediction accuracies of 49.5% and 48.8% for two different residue contact definitions, respectively. The predicted residue contacts were further employed to predict interacting helical pairs and achieved the Matthew's correlation coefficients of 0.430 and 0.424, according to two different residue contact definitions, respectively. To facilitate the academic community, the TMhhcp server has been made freely accessible at http://protein.cau.edu.cn/tmhhcp.

SUBMITTER: Wang XF 

PROVIDER: S-EPMC3203928 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting residue-residue contacts and helix-helix interactions in transmembrane proteins using an integrative feature-based random forest approach.

Wang Xiao-Feng XF   Chen Zhen Z   Wang Chuan C   Yan Ren-Xiang RX   Zhang Ziding Z   Song Jiangning J  

PloS one 20111028 10


Integral membrane proteins constitute 25-30% of genomes and play crucial roles in many biological processes. However, less than 1% of membrane protein structures are in the Protein Data Bank. In this context, it is important to develop reliable computational methods for predicting the structures of membrane proteins. Here, we present the first application of random forest (RF) for residue-residue contact prediction in transmembrane proteins, which we term as TMhhcp. Rigorous cross-validation tes  ...[more]

Similar Datasets

| S-EPMC2666818 | biostudies-other
| S-EPMC2841610 | biostudies-literature
| S-EPMC6419322 | biostudies-literature
| S-EPMC4206426 | biostudies-literature
| S-EPMC1538872 | biostudies-literature
| S-EPMC8624843 | biostudies-literature
| S-EPMC3509494 | biostudies-literature
| S-EPMC2900627 | biostudies-literature