Unknown

Dataset Information

0

MemBrain: improving the accuracy of predicting transmembrane helices.


ABSTRACT: Prediction of transmembrane helices (TMH) in alpha helical membrane proteins provides valuable information about the protein topology when the high resolution structures are not available. Many predictors have been developed based on either amino acid hydrophobicity scale or pure statistical approaches. While these predictors perform reasonably well in identifying the number of TMHs in a protein, they are generally inaccurate in predicting the ends of TMHs, or TMHs of unusual length. To improve the accuracy of TMH detection, we developed a machine-learning based predictor, MemBrain, which integrates a number of modern bioinformatics approaches including sequence representation by multiple sequence alignment matrix, the optimized evidence-theoretic K-nearest neighbor prediction algorithm, fusion of multiple prediction window sizes, and classification by dynamic threshold. MemBrain demonstrates an overall improvement of about 20% in prediction accuracy, particularly, in predicting the ends of TMHs and TMHs that are shorter than 15 residues. It also has the capability to detect N-terminal signal peptides. The MemBrain predictor is a useful sequence-based analysis tool for functional and structural characterization of helical membrane proteins; it is freely available at http://chou.med.harvard.edu/bioinf/MemBrain/.

SUBMITTER: Shen H 

PROVIDER: S-EPMC2396505 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

MemBrain: improving the accuracy of predicting transmembrane helices.

Shen Hongbin H   Chou James J JJ  

PloS one 20080611 6


Prediction of transmembrane helices (TMH) in alpha helical membrane proteins provides valuable information about the protein topology when the high resolution structures are not available. Many predictors have been developed based on either amino acid hydrophobicity scale or pure statistical approaches. While these predictors perform reasonably well in identifying the number of TMHs in a protein, they are generally inaccurate in predicting the ends of TMHs, or TMHs of unusual length. To improve  ...[more]

Similar Datasets

| S-EPMC2143072 | biostudies-other
| S-EPMC2242586 | biostudies-literature
| S-EPMC5073023 | biostudies-literature
| S-EPMC3025558 | biostudies-literature
| S-EPMC3053181 | biostudies-literature
| S-EPMC3608176 | biostudies-other
| S-EPMC6277224 | biostudies-literature
| S-EPMC3361389 | biostudies-literature
| S-EPMC5516182 | biostudies-literature
| S-EPMC3155016 | biostudies-literature