Dataset Information

Multiple structure alignment and consensus identification for proteins.

ABSTRACT:

Background

An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins.

Results

Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank.

Conclusions

An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.

SUBMITTER: Ilinkin I

PROVIDER: S-EPMC2829528 | biostudies-literature | 2010 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Multiple structure alignment and consensus identification for proteins.

Ilinkin Ivaylo I Ye Jieping J Janardan Ravi R

BMC bioinformatics 20100202

<h4>Background</h4>An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the conse ...[more]

PMID: 20122279

Dataset Information

Multiple structure alignment and consensus identification for proteins.

Background

Results

Conclusions

Publications

Multiple structure alignment and consensus identification for proteins.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy.
| S-EPMC4545697 | biostudies-literature

Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy.
| S-EPMC5073787 | biostudies-literature

Multiple protein structure alignment.
| S-EPMC2142613 | biostudies-other

Multiple structure alignment with msTALI.
| S-EPMC3473313 | biostudies-literature

MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments.
| S-EPMC3413523 | biostudies-literature

STRALCP--structure alignment-based clustering of proteins.
| S-EPMC2190701 | biostudies-literature

Proteins comparison through probabilistic optimal structure local alignment.
| S-EPMC4151033 | biostudies-literature

Structure-Based Alignment and Consensus Secondary Structures for Three HIV-Related RNA Genomes.
| S-EPMC4439019 | biostudies-literature

A data-mining approach for multiple structural alignment of proteins.
| S-EPMC2951672 | biostudies-literature

Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee.
| S-EPMC3303701 | biostudies-literature