Unknown

Dataset Information

0

Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee.


ABSTRACT:

Background

Transmembrane proteins (TMPs) constitute about 20~30% of all protein coding genes. The relative lack of experimental structure has so far made it hard to develop specific alignment methods and the current state of the art (PRALINE™) only manages to recapitulate 50% of the positions in the reference alignments available from the BAliBASE2-ref7.

Methods

We show how homology extension can be adapted and combined with a consistency based approach in order to significantly improve the multiple sequence alignment of alpha-helical TMPs. TM-Coffee is a special mode of PSI-Coffee able to efficiently align TMPs, while using a reduced reference database for homology extension.

Results

Our benchmarking on BAliBASE2-ref7 alpha-helical TMPs shows a significant improvement over the most accurate methods such as MSAProbs, Kalign, PROMALS, MAFFT, ProbCons and PRALINE™. We also estimated the influence of the database used for homology extension and show that highly non-redundant UniRef databases can be used to obtain similar results at a significantly reduced computational cost over full protein databases. TM-Coffee is part of the T-Coffee package, a web server is also available from http://tcoffee.crg.cat/tmcoffee and a freeware open source code can be downloaded from http://www.tcoffee.org/Packages/Stable/Latest.

SUBMITTER: Chang JM 

PROVIDER: S-EPMC3303701 | biostudies-literature | 2012 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee.

Chang Jia-Ming JM   Di Tommaso Paolo P   Taly Jean-François JF   Notredame Cedric C  

BMC bioinformatics 20120328


<h4>Background</h4>Transmembrane proteins (TMPs) constitute about 20~30% of all protein coding genes. The relative lack of experimental structure has so far made it hard to develop specific alignment methods and the current state of the art (PRALINE™) only manages to recapitulate 50% of the positions in the reference alignments available from the BAliBASE2-ref7.<h4>Methods</h4>We show how homology extension can be adapted and combined with a consistency based approach in order to significantly i  ...[more]

Similar Datasets

| S-EPMC4987888 | biostudies-literature
| S-EPMC5624947 | biostudies-literature
| S-EPMC5037421 | biostudies-literature
| S-EPMC1955456 | biostudies-literature
| S-EPMC3389763 | biostudies-literature
| S-EPMC7735675 | biostudies-literature
| S-EPMC7328376 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC1187875 | biostudies-literature
| S-EPMC6657586 | biostudies-literature