Unknown

Dataset Information

0

MAFFT version 5: improvement in accuracy of multiple sequence alignment.


ABSTRACT: The accuracy of multiple sequence alignment program MAFFT has been improved. The new version (5.3) of MAFFT offers new iterative refinement options, H-INS-i, F-INS-i and G-INS-i, in which pairwise alignment information are incorporated into objective function. These new options of MAFFT showed higher accuracy than currently available methods including TCoffee version 2 and CLUSTAL W in benchmark tests consisting of alignments of >50 sequences. Like the previously available options, the new options of MAFFT can handle hundreds of sequences on a standard desktop computer. We also examined the effect of the number of homologues included in an alignment. For a multiple alignment consisting of approximately 8 sequences with low similarity, the accuracy was improved (2-10 percentage points) when the sequences were aligned together with dozens of their close homologues (E-value < 10(-5)-10(-20)) collected from a database. Such improvement was generally observed for most methods, but remarkably large for the new options of MAFFT proposed here. Thus, we made a Ruby script, mafftE.rb, which aligns the input sequences together with their close homologues collected from SwissProt using NCBI-BLAST.

SUBMITTER: Katoh K 

PROVIDER: S-EPMC548345 | biostudies-literature | 2005

REPOSITORIES: biostudies-literature

altmetric image

Publications

MAFFT version 5: improvement in accuracy of multiple sequence alignment.

Katoh Kazutaka K   Kuma Kei-ichi K   Toh Hiroyuki H   Miyata Takashi T  

Nucleic acids research 20050120 2


The accuracy of multiple sequence alignment program MAFFT has been improved. The new version (5.3) of MAFFT offers new iterative refinement options, H-INS-i, F-INS-i and G-INS-i, in which pairwise alignment information are incorporated into objective function. These new options of MAFFT showed higher accuracy than currently available methods including TCoffee version 2 and CLUSTAL W in benchmark tests consisting of alignments of >50 sequences. Like the previously available options, the new optio  ...[more]

Similar Datasets

| S-EPMC3603318 | biostudies-literature
| S-EPMC2905546 | biostudies-literature
| S-EPMC4920119 | biostudies-literature
| S-EPMC2387179 | biostudies-literature
| S-EPMC1769516 | biostudies-literature
| S-EPMC2632924 | biostudies-literature
| S-EPMC8796358 | biostudies-literature
| S-EPMC6041967 | biostudies-literature
| S-EPMC280650 | biostudies-literature
| S-EPMC3592395 | biostudies-literature