Unknown

Dataset Information

0

A fast algorithm for determining the best combination of local alignments to a query sequence.


ABSTRACT: Existing sequence alignment algorithms assume that similarities between DNA or amino acid sequences are linearly ordered. That is, stretches of similar nucleotides or amino acids are in the same order in both sequences. Recombination perturbs this order. An algorithm that can reconstruct sequence similarity despite rearrangement would be helpful for reconstructing the evolutionary history of recombined sequences.We propose a graph-based algorithm for combining multiple local alignments to a query sequence into the single combination of alignments that either covers the maximal portion of the query or results in the single highest alignment score to the query. This algorithm can help study the process of genome rearrangement, improve functional gene annotation, and reconstruct the evolutionary history of recombined proteins. The algorithm takes O(n2) time, where n is the number of local alignments considered.We discuss two example applications of the algorithm. The algorithm is able to provide useful reconstructions of the metazoan mitochondrial genome. It is also able to increase the percentage of a query sequence's amino acid residues for which similar stretches of amino acids can be found in sequence databases.

SUBMITTER: Conant GC 

PROVIDER: S-EPMC436051 | biostudies-literature | 2004 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A fast algorithm for determining the best combination of local alignments to a query sequence.

Conant Gavin C GC   Wagner Andreas A  

BMC bioinformatics 20040518


<h4>Background</h4>Existing sequence alignment algorithms assume that similarities between DNA or amino acid sequences are linearly ordered. That is, stretches of similar nucleotides or amino acids are in the same order in both sequences. Recombination perturbs this order. An algorithm that can reconstruct sequence similarity despite rearrangement would be helpful for reconstructing the evolutionary history of recombined sequences.<h4>Results</h4>We propose a graph-based algorithm for combining  ...[more]

Similar Datasets

| S-EPMC1208854 | biostudies-literature
| S-EPMC9710668 | biostudies-literature
| S-EPMC101229 | biostudies-literature
| S-EPMC3638164 | biostudies-other
| S-EPMC3261699 | biostudies-literature
| S-EPMC5569537 | biostudies-literature
| S-EPMC1945026 | biostudies-literature
| S-EPMC4748600 | biostudies-literature
| S-EPMC3999979 | biostudies-literature
| S-EPMC4123905 | biostudies-literature