Unknown

Dataset Information

0

Instability in progressive multiple sequence alignment algorithms.


ABSTRACT: BACKGROUND:Progressive alignment is the standard approach used to align large numbers of sequences. As with all heuristics, this involves a tradeoff between alignment accuracy and computation time. RESULTS:We examine this tradeoff and find that, because of a loss of information in the early steps of the approach, the alignments generated by the most common multiple sequence alignment programs are inherently unstable, and simply reversing the order of the sequences in the input file will cause a different alignment to be generated. Although this effect is more obvious with larger numbers of sequences, it can also be seen with data sets in the order of one hundred sequences. We also outline the means to determine the number of sequences in a data set beyond which the probability of instability will become more pronounced. CONCLUSIONS:This has major ramifications for both the designers of large-scale multiple sequence alignment algorithms, and for the users of these alignments.

SUBMITTER: Boyce K 

PROVIDER: S-EPMC4599319 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Instability in progressive multiple sequence alignment algorithms.

Boyce Kieran K   Sievers Fabian F   Higgins Desmond G DG  

Algorithms for molecular biology : AMB 20151009


<h4>Background</h4>Progressive alignment is the standard approach used to align large numbers of sequences. As with all heuristics, this involves a tradeoff between alignment accuracy and computation time.<h4>Results</h4>We examine this tradeoff and find that, because of a loss of information in the early steps of the approach, the alignments generated by the most common multiple sequence alignment programs are inherently unstable, and simply reversing the order of the sequences in the input fil  ...[more]

Similar Datasets

| S-EPMC6151001 | biostudies-literature
| S-EPMC2478692 | biostudies-literature
| S-EPMC2632924 | biostudies-literature
| S-EPMC3592395 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC2943993 | biostudies-literature
| S-EPMC1180752 | biostudies-literature
| S-EPMC1948021 | biostudies-literature
| S-EPMC8289385 | biostudies-literature
| S-EPMC6657586 | biostudies-literature