Unknown

Dataset Information

0

Inferring phylogenies of evolving sequences without multiple sequence alignment.


ABSTRACT: Alignment-free methods, in which shared properties of sub-sequences (e.g. identity or match length) are extracted and used to compute a distance matrix, have recently been explored for phylogenetic inference. However, the scalability and robustness of these methods to key evolutionary processes remain to be investigated. Here, using simulated sequence sets of various sizes in both nucleotides and amino acids, we systematically assess the accuracy of phylogenetic inference using an alignment-free approach, based on D2 statistics, under different evolutionary scenarios. We find that compared to a multiple sequence alignment approach, D2 methods are more robust against among-site rate heterogeneity, compositional biases, genetic rearrangements and insertions/deletions, but are more sensitive to recent sequence divergence and sequence truncation. Across diverse empirical datasets, the alignment-free methods perform well for sequences sharing low divergence, at greater computation speed. Our findings provide strong evidence for the scalability and the potential use of alignment-free methods in large-scale phylogenomics.

SUBMITTER: Chan CX 

PROVIDER: S-EPMC4179140 | biostudies-literature | 2014 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inferring phylogenies of evolving sequences without multiple sequence alignment.

Chan Cheong Xin CX   Bernard Guillaume G   Poirion Olivier O   Hogan James M JM   Ragan Mark A MA  

Scientific reports 20140930


Alignment-free methods, in which shared properties of sub-sequences (e.g. identity or match length) are extracted and used to compute a distance matrix, have recently been explored for phylogenetic inference. However, the scalability and robustness of these methods to key evolutionary processes remain to be investigated. Here, using simulated sequence sets of various sizes in both nucleotides and amino acids, we systematically assess the accuracy of phylogenetic inference using an alignment-free  ...[more]

Similar Datasets

| S-EPMC3320897 | biostudies-literature
| S-EPMC8796358 | biostudies-literature
| S-EPMC1955456 | biostudies-literature
| S-EPMC3014950 | biostudies-literature
| S-EPMC4424971 | biostudies-literature
| S-EPMC10592837 | biostudies-literature
| S-EPMC7462517 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC546147 | biostudies-literature
| S-EPMC441520 | biostudies-literature