Unknown

Dataset Information

0

DeepMicrobes: taxonomic classification for metagenomics with deep learning.


ABSTRACT: Large-scale metagenomic assemblies have uncovered thousands of new species greatly expanding the known diversity of microbiomes in specific habitats. To investigate the roles of these uncultured species in human health or the environment, researchers need to incorporate their genome assemblies into a reference database for taxonomic classification. However, this procedure is hindered by the lack of a well-curated taxonomic tree for newly discovered species, which is required by current metagenomics tools. Here we report DeepMicrobes, a deep learning-based computational framework for taxonomic classification that allows researchers to bypass this limitation. We show the advantage of DeepMicrobes over state-of-the-art tools in species and genus identification and comparable accuracy in abundance estimation. We trained DeepMicrobes on genomes reconstructed from gut microbiomes and discovered potential novel signatures in inflammatory bowel diseases. DeepMicrobes facilitates effective investigations into the uncharacterized roles of metagenomic species.

SUBMITTER: Liang Q 

PROVIDER: S-EPMC7671387 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

DeepMicrobes: taxonomic classification for metagenomics with deep learning.

Liang Qiaoxing Q   Bible Paul W PW   Liu Yu Y   Zou Bin B   Wei Lai L  

NAR genomics and bioinformatics 20200219 1


Large-scale metagenomic assemblies have uncovered thousands of new species greatly expanding the known diversity of microbiomes in specific habitats. To investigate the roles of these uncultured species in human health or the environment, researchers need to incorporate their genome assemblies into a reference database for taxonomic classification. However, this procedure is hindered by the lack of a well-curated taxonomic tree for newly discovered species, which is required by current metagenom  ...[more]

Similar Datasets

| S-EPMC6716367 | biostudies-literature
| S-EPMC4833860 | biostudies-other
| S-EPMC6069770 | biostudies-literature
| S-EPMC7255349 | biostudies-literature
| S-EPMC4896366 | biostudies-literature
| S-EPMC4309676 | biostudies-literature
| S-EPMC7498351 | biostudies-literature
2022-12-22 | GSE218466 | GEO
| S-EPMC8634433 | biostudies-literature