Unknown

Dataset Information

0

CAMITAX: Taxon labels for microbial genomes.


ABSTRACT: BACKGROUND:The number of microbial genome sequences is increasing exponentially, especially thanks to recent advances in recovering complete or near-complete genomes from metagenomes and single cells. Assigning reliable taxon labels to genomes is key and often a prerequisite for downstream analyses. FINDINGS:We introduce CAMITAX, a scalable and reproducible workflow for the taxonomic labelling of microbial genomes recovered from isolates, single cells, and metagenomes. CAMITAX combines genome distance-, 16S ribosomal RNA gene-, and gene homology-based taxonomic assignments with phylogenetic placement. It uses Nextflow to orchestrate reference databases and software containers and thus combines ease of installation and use with computational reproducibility. We evaluated the method on several hundred metagenome-assembled genomes with high-quality taxonomic annotations from the TARA Oceans project, and we show that the ensemble classification method in CAMITAX improved on all individual methods across tested ranks. CONCLUSIONS:While we initially developed CAMITAX to aid the Critical Assessment of Metagenome Interpretation (CAMI) initiative, it evolved into a comprehensive software package to reliably assign taxon labels to microbial genomes. CAMITAX is available under Apache License 2.0 at https://github.com/CAMI-challenge/CAMITAX.

SUBMITTER: Bremges A 

PROVIDER: S-EPMC6946028 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

CAMITAX: Taxon labels for microbial genomes.

Bremges Andreas A   Fritz Adrian A   McHardy Alice C AC  

GigaScience 20200101 1


<h4>Background</h4>The number of microbial genome sequences is increasing exponentially, especially thanks to recent advances in recovering complete or near-complete genomes from metagenomes and single cells. Assigning reliable taxon labels to genomes is key and often a prerequisite for downstream analyses.<h4>Findings</h4>We introduce CAMITAX, a scalable and reproducible workflow for the taxonomic labelling of microbial genomes recovered from isolates, single cells, and metagenomes. CAMITAX com  ...[more]

Similar Datasets

| S-EPMC5270559 | biostudies-literature
| S-EPMC3260501 | biostudies-literature
| S-EPMC3370833 | biostudies-other
| S-EPMC6034075 | biostudies-literature
| S-EPMC3914688 | biostudies-literature
| S-EPMC2820454 | biostudies-literature
| S-EPMC2864248 | biostudies-literature
| S-EPMC3409199 | biostudies-literature
| S-EPMC186635 | biostudies-literature
| S-EPMC4673824 | biostudies-other