Unknown

Dataset Information

0

FASTRAL: Improving scalability of phylogenomic analysis.


ABSTRACT:

Motivation

ASTRAL is the current leading method for species tree estimation from phylogenomic datasets (i.e., hundreds to thousands of genes) that addresses gene tree discord resulting from incomplete lineage sorting (ILS). ASTRAL is statistically consistent under the multi-locus coalescent model (MSC), runs in polynomial time, and is able to run on large datasets. Key to ASTRAL's algorithm is the use of dynamic programming to find an optimal solution to the MQSST (maximum quartet support supertree) within a constraint space that it computes from the input. Yet, ASTRAL can fail to complete within reasonable timeframes on large datasets with many genes and species, because in these cases the constraint space it computes is too large.

Results

Here we introduce FASTRAL, a phylogenomic estimation method. FASTRAL is based on ASTRAL, but uses a different technique for constructing the constraint space. The technique we use to define the constraint space maintains statistical consistency and is polynomial time; thus we prove that FASTRAL is a polynomial time algorithm that is statistically consistent under the MSC. Our performance study on both biological and simulated data sets demonstrates that FASTRAL matches or improves on ASTRAL with respect to species tree topology accuracy (and under high ILS conditions it is statistically significantly more accurate), while being dramatically faster-especially on datasets with large numbers of genes and high ILS-due to using a significantly smaller constraint space.

Availability

FASTRAL is available in open-source form at https://github.com/PayamDiba/FASTRAL.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Dibaeinia P 

PROVIDER: S-EPMC8388037 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8728279 | biostudies-literature
| S-EPMC4239591 | biostudies-literature
| S-EPMC6612878 | biostudies-other
| S-EPMC6945013 | biostudies-literature
| PRJEB18489 | ENA
| S-EPMC2904699 | biostudies-literature
| S-EPMC4868118 | biostudies-literature
| S-EPMC3531168 | biostudies-literature
| S-EPMC6047143 | biostudies-literature
| S-EPMC1810536 | biostudies-literature