Unknown

Dataset Information

0

Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene.


ABSTRACT: The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogenetic inference with 16s rRNA gene data. We use a taxonomic gold standard to compare the accuracy of alignment-free phylogenetic inference with that of common microbiome-wide phylogenetic inference pipelines based on PyNAST and MUSCLE alignments with FastTree and RAxML. We re-simulate fecal communities from Human Microbiome Project data to evaluate the performance of the methods on datasets with properties of real data. Our comparisons show that alignment-free methods are not inferior to alignment-based methods in giving accurate and robust phylogenic trees. Moreover, consensus ensembles of alignment-free phylogenies are superior to those built from alignment-based methods in their ability to highlight community differences in low power settings. In addition, the overall running times of alignment-based and alignment-free phylogenetic inference are comparable. Taken together our empirical results suggest that alignment-free methods provide a viable approach for microbiome-wide phylogenetic inference.

SUBMITTER: Zhang Y 

PROVIDER: S-EPMC5685621 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene.

Zhang Yifei Y   Alekseyenko Alexander V AV  

PloS one 20171114 11


The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogen  ...[more]

Similar Datasets

| S-EPMC6852867 | biostudies-literature
| S-EPMC7455996 | biostudies-literature
| S-EPMC3819121 | biostudies-literature
| S-EPMC3864673 | biostudies-literature
| S-EPMC5747434 | biostudies-literature
| S-EPMC3001099 | biostudies-literature
| S-EPMC5069956 | biostudies-literature