Unknown

Dataset Information

0

A novel method for multiple alignment of sequences with repeated and shuffled elements.


ABSTRACT: We describe ABA (A-Bruijn alignment), a new method for multiple alignment of biological sequences. The major difference between ABA and existing multiple alignment methods is that ABA represents an alignment as a directed graph, possibly containing cycles. This representation provides more flexibility than does a traditional alignment matrix or the recently introduced partial order alignment (POA) graph by allowing a larger class of evolutionary relationships between the aligned sequences. Our graph representation is particularly well-suited to the alignment of protein sequences with shuffled and/or repeated domain structure, and allows one to construct multiple alignments of proteins containing (1) domains that are not present in all proteins, (2) domains that are present in different orders in different proteins, and (3) domains that are present in multiple copies in some proteins. In addition, ABA is useful in the alignment of genomic sequences that contain duplications and inversions. We provide several examples illustrating the applications of ABA.

SUBMITTER: Raphael B 

PROVIDER: S-EPMC525693 | biostudies-literature | 2004 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel method for multiple alignment of sequences with repeated and shuffled elements.

Raphael Benjamin B   Zhi Degui D   Tang Haixu H   Pevzner Pavel P  

Genome research 20041101 11


We describe ABA (A-Bruijn alignment), a new method for multiple alignment of biological sequences. The major difference between ABA and existing multiple alignment methods is that ABA represents an alignment as a directed graph, possibly containing cycles. This representation provides more flexibility than does a traditional alignment matrix or the recently introduced partial order alignment (POA) graph by allowing a larger class of evolutionary relationships between the aligned sequences. Our g  ...[more]

Similar Datasets

| S-EPMC6330207 | biostudies-literature
| S-EPMC8998981 | biostudies-literature
| S-EPMC546147 | biostudies-literature
| S-EPMC373279 | biostudies-literature
| S-EPMC10809904 | biostudies-literature
| S-EPMC9602327 | biostudies-literature
| S-EPMC2848238 | biostudies-literature
| S-EPMC4179140 | biostudies-literature
| S-EPMC1539025 | biostudies-literature
| S-EPMC10148686 | biostudies-literature