Unknown

Dataset Information

0

WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences.


ABSTRACT:

Background

Metagenome shotgun sequencing presents opportunities to identify organisms that may prevent or promote disease. The analysis of sample diversity is achieved by taxonomic identification of metagenomic reads followed by generating an abundance profile. Numerous tools have been developed based on different design principles. Tools achieving high precision can lack sensitivity in some applications. Conversely, tools with high sensitivity can suffer from low precision and require long computation time.

Methods

In this paper, we present WEVOTE (WEighted VOting Taxonomic idEntification), a method that classifies metagenome shotgun sequencing DNA reads based on an ensemble of existing methods using k-mer-based, marker-based, and naive-similarity based approaches. Our evaluation on fourteen benchmarking datasets shows that WEVOTE improves the classification precision by reducing false positive annotations while preserving a high level of sensitivity.

Conclusions

WEVOTE is an efficient and automated tool that combines multiple individual taxonomic identification methods to produce more precise and sensitive microbial profiles. WEVOTE is developed primarily to identify reads generated by MetaGenome Shotgun sequencing. It is expandable and has the potential to incorporate additional tools to produce a more accurate taxonomic profile. WEVOTE was implemented using C++ and shell scripting and is available at www.github.com/aametwally/WEVOTE.

SUBMITTER: Metwally AA 

PROVIDER: S-EPMC5040256 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences.

Metwally Ahmed A AA   Dai Yang Y   Finn Patricia W PW   Perkins David L DL  

PloS one 20160928 9


<h4>Background</h4>Metagenome shotgun sequencing presents opportunities to identify organisms that may prevent or promote disease. The analysis of sample diversity is achieved by taxonomic identification of metagenomic reads followed by generating an abundance profile. Numerous tools have been developed based on different design principles. Tools achieving high precision can lack sensitivity in some applications. Conversely, tools with high sensitivity can suffer from low precision and require l  ...[more]

Similar Datasets

| S-EPMC7960862 | biostudies-literature
| S-EPMC8320732 | biostudies-literature
| S-EPMC2957682 | biostudies-literature
| S-EPMC6805573 | biostudies-literature
| PRJEB77351 | ENA
| S-EPMC2894824 | biostudies-other
| S-EPMC3224148 | biostudies-literature
| S-EPMC4126192 | biostudies-literature
| S-EPMC9805571 | biostudies-literature
| S-EPMC2275094 | biostudies-literature