Unknown

Dataset Information

0

ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes.


ABSTRACT: The estimation of species phylogenies requires multiple loci, since different loci can have different trees due to incomplete lineage sorting, modeled by the multi-species coalescent model. We recently developed a coalescent-based method, ASTRAL, which is statistically consistent under the multi-species coalescent model and which is more accurate than other coalescent-based methods on the datasets we examined. ASTRAL runs in polynomial time, by constraining the search space using a set of allowed 'bipartitions'. Despite the limitation to allowed bipartitions, ASTRAL is statistically consistent.We present a new version of ASTRAL, which we call ASTRAL-II. We show that ASTRAL-II has substantial advantages over ASTRAL: it is faster, can analyze much larger datasets (up to 1000 species and 1000 genes) and has substantially better accuracy under some conditions. ASTRAL's running time is [Formula: see text], and ASTRAL-II's running time is [Formula: see text], where n is the number of species, k is the number of loci and X is the set of allowed bipartitions for the search space.ASTRAL-II is available in open source at https://github.com/smirarab/ASTRAL and datasets used are available at http://www.cs.utexas.edu/~phylo/datasets/astral2/.smirarab@gmail.comSupplementary data are available at Bioinformatics online.

SUBMITTER: Mirarab S 

PROVIDER: S-EPMC4765870 | biostudies-other | 2015 Jun

REPOSITORIES: biostudies-other

altmetric image

Publications

ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes.

Mirarab Siavash S   Warnow Tandy T  

Bioinformatics (Oxford, England) 20150601 12


<h4>Motivation</h4>The estimation of species phylogenies requires multiple loci, since different loci can have different trees due to incomplete lineage sorting, modeled by the multi-species coalescent model. We recently developed a coalescent-based method, ASTRAL, which is statistically consistent under the multi-species coalescent model and which is more accurate than other coalescent-based methods on the datasets we examined. ASTRAL runs in polynomial time, by constraining the search space us  ...[more]

Similar Datasets

| S-EPMC4147915 | biostudies-literature
| S-EPMC5998899 | biostudies-literature
| S-EPMC4604832 | biostudies-literature
| S-EPMC4341064 | biostudies-literature
| S-EPMC6538240 | biostudies-literature
| S-EPMC7161100 | biostudies-literature
| S-EPMC5009668 | biostudies-literature
| S-EPMC4206468 | biostudies-literature
| S-EPMC3904522 | biostudies-literature
| S-EPMC2104643 | biostudies-literature