Unknown

Dataset Information

0

Evolutionary inaccuracy of pairwise structural alignments.


ABSTRACT: Structural alignment methods are widely used to generate gold standard alignments for improving multiple sequence alignments and transferring functional annotations, as well as for assigning structural distances between proteins. However, the correctness of the alignments generated by these methods is difficult to assess objectively since little is known about the exact evolutionary history of most proteins. Since homology is an equivalence relation, an upper bound on alignment quality can be found by assessing the consistency of alignments. Measuring the consistency of current methods of structure alignment and determining the causes of inconsistencies can, therefore, provide information on the quality of current methods and suggest possibilities for further improvement.We analyze the self-consistency of seven widely-used structural alignment methods (SAP, TM-align, Fr-TM-align, MAMMOTH, DALI, CE and FATCAT) on a diverse, non-redundant set of 1863 domains from the SCOP database and demonstrate that even for relatively similar proteins the degree of inconsistency of the alignments on a residue level is high (30%). We further show that levels of consistency vary substantially between methods, with two methods (SAP and Fr-TM-align) producing more consistent alignments than the rest. Inconsistency is found to be higher near gaps and for proteins of low structural complexity, as well as for helices. The ability of the methods to identify good structural alignments is also assessed using geometric measures, for which FATCAT (flexible mode) is found to be the best performer despite being highly inconsistent. We conclude that there is substantial scope for improving the consistency of structural alignment methods.msadows@nimr.mrc.ac.ukSupplementary data are available at Bioinformatics online.

SUBMITTER: Sadowski MI 

PROVIDER: S-EPMC3338010 | biostudies-literature | 2012 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evolutionary inaccuracy of pairwise structural alignments.

Sadowski M I MI   Taylor W R WR  

Bioinformatics (Oxford, England) 20120306 9


<h4>Motivation</h4>Structural alignment methods are widely used to generate gold standard alignments for improving multiple sequence alignments and transferring functional annotations, as well as for assigning structural distances between proteins. However, the correctness of the alignments generated by these methods is difficult to assess objectively since little is known about the exact evolutionary history of most proteins. Since homology is an equivalence relation, an upper bound on alignmen  ...[more]

Similar Datasets

| S-EPMC2014794 | biostudies-literature
| S-EPMC2253518 | biostudies-literature
| S-EPMC3394275 | biostudies-literature
| S-EPMC4739816 | biostudies-literature
| S-EPMC2850363 | biostudies-literature
| S-EPMC7660437 | biostudies-literature
| S-EPMC4597059 | biostudies-literature
| S-EPMC8087094 | biostudies-literature
| S-EPMC3287116 | biostudies-literature
| S-EPMC4748600 | biostudies-literature