Dataset Information

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities.

ABSTRACT:

Background

Microbiome studies commonly use 16S rRNA gene amplicon sequencing to characterize microbial communities. Errors introduced at multiple steps in this process can affect the interpretation of the data. Here we evaluate the accuracy of operational taxonomic unit (OTU) generation, taxonomic classification, alpha- and beta-diversity measures for different settings in QIIME, MOTHUR and a pplacer-based classification pipeline, using a novel software package: DECARD.

Results

In-silico we generated 100 synthetic bacterial communities approximating human stool microbiomes to be used as a gold-standard for evaluating the colligative performance of microbiome analysis software. Our synthetic data closely matched the composition and complexity of actual healthy human stool microbiomes. Genus-level taxonomic classification was correctly done for only 50.4-74.8% of the source organisms. Miscall rates varied from 11.9 to 23.5%. Species-level classification was less successful, (6.9-18.9% correct); miscall rates were comparable to those of genus-level targets (12.5-26.2%). The degree of miscall varied by clade of organism, pipeline and specific settings used. OTU generation accuracy varied by strategy (closed, de novo or subsampling), reference database, algorithm and software implementation. Shannon diversity estimation accuracy correlated generally with OTU-generation accuracy. Beta-diversity estimates with Double Principle Coordinate Analysis (DPCoA) were more robust against errors introduced in processing than Weighted UniFrac. The settings suggested in the tutorials were among the worst performing in all outcomes tested.

Conclusions

Even when using the same classification pipeline, the specific OTU-generation strategy, reference database and downstream analysis methods selection can have a dramatic effect on the accuracy of taxonomic classification, and alpha- and beta-diversity estimation. Even minor changes in settings adversely affected the accuracy of the results, bringing them far from the best-observed result. Thus, specific details of how a pipeline is used (including OTU generation strategy, reference sets, clustering algorithm and specific software implementation) should be specified in the methods section of all microbiome studies. Researchers should evaluate their chosen pipeline and settings to confirm it can adequately answer the research question rather than assuming the tutorial or standard-operating-procedure settings will be adequate or optimal.

SUBMITTER: Golob JL

PROVIDER: S-EPMC5450146 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities.

Golob Jonathan L JL Margolis Elisa E Hoffman Noah G NG Fredricks David N DN

BMC bioinformatics 20170530 1

<h4>Background</h4>Microbiome studies commonly use 16S rRNA gene amplicon sequencing to characterize microbial communities. Errors introduced at multiple steps in this process can affect the interpretation of the data. Here we evaluate the accuracy of operational taxonomic unit (OTU) generation, taxonomic classification, alpha- and beta-diversity measures for different settings in QIIME, MOTHUR and a pplacer-based classification pipeline, using a novel software package: DECARD.<h4>Results</h4>In ...[more]

PMID: 28558684

Dataset Information

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities.

Background

Results

Conclusions

Publications

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing.
| S-EPMC6964864 | biostudies-literature

Deciphering microbial interactions in synthetic human gut microbiome communities.
| S-EPMC6011841 | biostudies-literature

Characterization of microbial communities in gas industry pipelines.
| S-EPMC194955 | biostudies-literature

Effect of certain microorganisms on mouse gut microbial communities.
2021-09-24 | GSE180087 | GEO

Microbial communities in the gut of the beetle Odontotaenius disjunctus
2012-08-13 | E-GEOD-40067 | biostudies-arrayexpress

Microbial communities in the gut of the beetle Odontotaenius disjunctus
2012-08-14 | GSE40067 | GEO

The dynamics of gut-associated microbial communities during inflammation.
| S-EPMC3615657 | biostudies-other

Microbial Communities in Simulated Lanfills
| PRJEB23759 | ENA

Evaluating microbial communities through metagenomes
| PRJEB17887 | ENA

Meta-Apo improves accuracy of 16S-amplicon-based prediction of microbiome function.
| S-EPMC7788972 | biostudies-literature