Unknown

Dataset Information

0

Metagenomic profiling of known and unknown microbes with microbeGPS.


ABSTRACT: Microbial community profiling identifies and quantifies organisms in metagenomic sequencing data using either reference based or unsupervised approaches. However, current reference based profiling methods only report the presence and abundance of single reference genomes that are available in databases. Since only a small fraction of environmental genomes is represented in genomic databases, these approaches entail the risk of false identifications and often suggest a higher precision than justified by the data. Therefore, we developed MicrobeGPS, a novel metagenomic profiling approach that overcomes these limitations. MicrobeGPS is the first method that identifies microbiota in the sample and estimates their genomic distances to known reference genomes. With this strategy, MicrobeGPS identifies organisms down to the strain level and highlights possibly inaccurate identifications when the correct reference genome is missing. We demonstrate on three metagenomic datasets with different origin that our approach successfully avoids misleading interpretation of results and additionally provides more accurate results than current profiling methods. Our results indicate that MicrobeGPS can enable reference based taxonomic profiling of complex and less characterized microbial communities. MicrobeGPS is open source and available from https://sourceforge.net/projects/microbegps/ as source code and binary distribution for Windows and Linux operating systems.

SUBMITTER: Lindner MS 

PROVIDER: S-EPMC4314203 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Metagenomic profiling of known and unknown microbes with microbeGPS.

Lindner Martin S MS   Renard Bernhard Y BY  

PloS one 20150202 2


Microbial community profiling identifies and quantifies organisms in metagenomic sequencing data using either reference based or unsupervised approaches. However, current reference based profiling methods only report the presence and abundance of single reference genomes that are available in databases. Since only a small fraction of environmental genomes is represented in genomic databases, these approaches entail the risk of false identifications and often suggest a higher precision than justi  ...[more]

Similar Datasets

| S-EPMC5809485 | biostudies-literature
| S-EPMC8352191 | biostudies-literature
| S-EPMC4834863 | biostudies-literature
| S-EPMC10381864 | biostudies-literature
| S-EPMC4959991 | biostudies-other
| S-EPMC9132574 | biostudies-literature
| S-EPMC8051032 | biostudies-literature
| S-EPMC6445133 | biostudies-literature
| S-EPMC6170764 | biostudies-literature
| S-EPMC6028493 | biostudies-literature