Unknown

Dataset Information

0

Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost.


ABSTRACT:

Background

Multiple sequence alignment (MSA) is a useful tool in bioinformatics. Although many MSA algorithms have been developed, there is still room for improvement in accuracy and speed. In the alignment of a family of protein sequences, global MSA algorithms perform better than local ones in many cases, while local ones perform better than global ones when some sequences have long insertions or deletions (indels) relative to others. Many recent leading MSA algorithms have incorporated pairwise alignment information obtained from a mixture of sources into their scoring system to improve accuracy of alignment containing long indels.

Results

We propose a novel group-to-group sequence alignment algorithm that uses a piecewise linear gap cost. We developed a program called PRIME, which employs our proposed algorithm to optimize the well-defined sum-of-pairs score. PRIME stands for Profile-based Randomized Iteration MEthod. We evaluated PRIME and some recent MSA programs using BAliBASE version 3.0 and PREFAB version 4.0 benchmarks. The results of benchmark tests showed that PRIME can construct accurate alignments comparable to the most accurate programs currently available, including L-INS-i of MAFFT, ProbCons, and T-Coffee.

Conclusion

PRIME enables users to construct accurate alignments without having to employ pairwise alignment information. PRIME is available at http://prime.cbrc.jp/.

SUBMITTER: Yamada S 

PROVIDER: S-EPMC1769516 | biostudies-literature | 2006 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost.

Yamada Shinsuke S   Gotoh Osamu O   Yamana Hayato H  

BMC bioinformatics 20061201


<h4>Background</h4>Multiple sequence alignment (MSA) is a useful tool in bioinformatics. Although many MSA algorithms have been developed, there is still room for improvement in accuracy and speed. In the alignment of a family of protein sequences, global MSA algorithms perform better than local ones in many cases, while local ones perform better than global ones when some sequences have long insertions or deletions (indels) relative to others. Many recent leading MSA algorithms have incorporate  ...[more]

Similar Datasets

| S-EPMC548345 | biostudies-literature
| S-EPMC8355039 | biostudies-literature
| S-EPMC5995191 | biostudies-literature
| S-EPMC101229 | biostudies-literature
| S-EPMC145823 | biostudies-other
| S-EPMC147093 | biostudies-other
| S-EPMC3638164 | biostudies-other
| S-EPMC6825651 | biostudies-literature
| S-EPMC2850363 | biostudies-literature
| S-EPMC2632924 | biostudies-literature