Unknown

Dataset Information

0

Fast and sensitive taxonomic assignment to metagenomic contigs.


ABSTRACT:

Summary

MMseqs2 taxonomy is a new tool to assign taxonomic labels to metagenomic contigs. It extracts all possible protein fragments from each contig, quickly retains those that can contribute to taxonomic annotation, assigns them with robust labels and determines the contig's taxonomic identity by weighted voting. Its fragment extraction step is suitable for the analysis of all domains of life. MMseqs2 taxonomy is 2-18× faster than state-of-the-art tools and also contains new modules for creating and manipulating taxonomic reference databases as well as reporting and visualizing taxonomic assignments.

Availability and implementation

MMseqs2 taxonomy is part of the MMseqs2 free open-source software package available for Linux, macOS and Windows at https://mmseqs.com.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Mirdita M 

PROVIDER: S-EPMC8479651 | biostudies-literature | 2021 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast and sensitive taxonomic assignment to metagenomic contigs.

Mirdita M M   Steinegger M M   Breitwieser F F   Söding J J   Levy Karin E E  

Bioinformatics (Oxford, England) 20210901 18


<h4>Summary</h4>MMseqs2 taxonomy is a new tool to assign taxonomic labels to metagenomic contigs. It extracts all possible protein fragments from each contig, quickly retains those that can contribute to taxonomic annotation, assigns them with robust labels and determines the contig's taxonomic identity by weighted voting. Its fragment extraction step is suitable for the analysis of all domains of life. MMseqs2 taxonomy is 2-18× faster than state-of-the-art tools and also contains new modules fo  ...[more]

Similar Datasets

2013-11-22 | GSE47690 | GEO
| S-EPMC3462201 | biostudies-literature
| PRJEB19201 | ENA
| S-EPMC3319535 | biostudies-literature
| S-EPMC4833860 | biostudies-other
| S-EPMC11437175 | biostudies-literature
2013-11-22 | E-GEOD-47690 | biostudies-arrayexpress
| S-EPMC4380030 | biostudies-literature
| S-EPMC8921650 | biostudies-literature
| S-EPMC8908641 | biostudies-literature