Unknown

Dataset Information

0

Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics.


ABSTRACT: Metagenomics research has recently thrived due to DNA sequencing technologies improvement, driving the emergence of new analysis tools and the growth of taxonomic databases. However, there is no all-purpose strategy that can guarantee the best result for a given project and there are several combinations of software, parameters and databases that can be tested. Therefore, we performed an impartial comparison, using statistical measures of classification for eight bioinformatic tools and four taxonomic databases, defining a benchmark framework to evaluate each tool in a standardized context. Using in silico simulated data for 16S rRNA amplicons and whole metagenome shotgun data, we compared the results from different software and database combinations to detect biases related to algorithms or database annotation. Using our benchmark framework, researchers can define cut-off values to evaluate the expected error rate and coverage for their results, regardless the score used by each software. A quick guide to select the best tool, all datasets and scripts to reproduce our results and benchmark any new method are available at https://github.com/Ales-ibt/Metagenomic-benchmark . Finally, we stress out the importance of gold standards, database curation and manual inspection of taxonomic profiling results, for a better and more accurate microbial diversity description.

SUBMITTER: Escobar-Zepeda A 

PROVIDER: S-EPMC6089906 | biostudies-literature | 2018 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics.

Escobar-Zepeda Alejandra A   Godoy-Lozano Elizabeth Ernestina EE   Raggi Luciana L   Segovia Lorenzo L   Merino Enrique E   Gutiérrez-Rios Rosa María RM   Juarez Katy K   Licea-Navarro Alexei F AF   Pardo-Lopez Liliana L   Sanchez-Flores Alejandro A  

Scientific reports 20180813 1


Metagenomics research has recently thrived due to DNA sequencing technologies improvement, driving the emergence of new analysis tools and the growth of taxonomic databases. However, there is no all-purpose strategy that can guarantee the best result for a given project and there are several combinations of software, parameters and databases that can be tested. Therefore, we performed an impartial comparison, using statistical measures of classification for eight bioinformatic tools and four tax  ...[more]

Similar Datasets

| S-EPMC6716367 | biostudies-literature
| S-EPMC2816706 | biostudies-literature
| S-EPMC4833860 | biostudies-other
| S-EPMC4551696 | biostudies-literature
| S-EPMC5557516 | biostudies-literature
2022-09-22 | GSE183942 | GEO
| S-EPMC3018814 | biostudies-other
| S-EPMC7671387 | biostudies-literature
| S-EPMC7161568 | biostudies-literature
| S-EPMC9028594 | biostudies-literature