Unknown

Dataset Information

0

A general species delimitation method with applications to phylogenetic placements.


ABSTRACT: MOTIVATION: Sequence-based methods to delimit species are central to DNA taxonomy, microbial community surveys and DNA metabarcoding studies. Current approaches either rely on simple sequence similarity thresholds (OTU-picking) or on complex and compute-intensive evolutionary models. The OTU-picking methods scale well on large datasets, but the results are highly sensitive to the similarity threshold. Coalescent-based species delimitation approaches often rely on Bayesian statistics and Markov Chain Monte Carlo sampling, and can therefore only be applied to small datasets. RESULTS: We introduce the Poisson tree processes (PTP) model to infer putative species boundaries on a given phylogenetic input tree. We also integrate PTP with our evolutionary placement algorithm (EPA-PTP) to count the number of species in phylogenetic placements. We compare our approaches with popular OTU-picking methods and the General Mixed Yule Coalescent (GMYC) model. For de novo species delimitation, the stand-alone PTP model generally outperforms GYMC as well as OTU-picking methods when evolutionary distances between species are small. PTP neither requires an ultrametric input tree nor a sequence similarity threshold as input. In the open reference species delimitation approach, EPA-PTP yields more accurate results than de novo species delimitation methods. Finally, EPA-PTP scales on large datasets because it relies on the parallel implementations of the EPA and RAxML, thereby allowing to delimit species in high-throughput sequencing data. AVAILABILITY AND IMPLEMENTATION: The code is freely available at www.exelixis-lab.org/software.html. .

SUBMITTER: Zhang J 

PROVIDER: S-EPMC3810850 | biostudies-other | 2013 Nov

REPOSITORIES: biostudies-other

altmetric image

Publications

A general species delimitation method with applications to phylogenetic placements.

Zhang Jiajie J   Kapli Paschalia P   Pavlidis Pavlos P   Stamatakis Alexandros A  

Bioinformatics (Oxford, England) 20130829 22


<h4>Motivation</h4>Sequence-based methods to delimit species are central to DNA taxonomy, microbial community surveys and DNA metabarcoding studies. Current approaches either rely on simple sequence similarity thresholds (OTU-picking) or on complex and compute-intensive evolutionary models. The OTU-picking methods scale well on large datasets, but the results are highly sensitive to the similarity threshold. Coalescent-based species delimitation approaches often rely on Bayesian statistics and M  ...[more]

Similar Datasets

| S-EPMC4374709 | biostudies-literature
| S-EPMC4841241 | biostudies-literature
| S-EPMC5648662 | biostudies-literature
| S-EPMC3503838 | biostudies-literature
| PRJEB56182 | ENA
| PRJEB50352 | ENA
| S-EPMC5920023 | biostudies-literature
| S-EPMC5207705 | biostudies-literature
| S-EPMC3728320 | biostudies-literature
| S-EPMC6512763 | biostudies-literature