Unknown

Dataset Information

0

ImGLAD: accurate detection and quantification of target organisms in metagenomes.


ABSTRACT: Accurate detection of target microbial species in metagenomic datasets from environmental samples remains limited because the limit of detection of current methods is typically inaccessible and the frequency of false-positives, resulting from inadequate identification of regions of the genome that are either too highly conserved to be diagnostic (e.g., rRNA genes) or prone to frequent horizontal genetic exchange (e.g., mobile elements) remains unknown. To overcome these limitations, we introduce imGLAD, which aims to detect (target) genomic sequences in metagenomic datasets. imGLAD achieves high accuracy because it uses the sequence-discrete population concept for discriminating between metagenomic reads originating from the target organism compared to reads from co-occurring close relatives, masks regions of the genome that are not informative using the MyTaxa engine, and models both the sequencing breadth and depth to determine relative abundance and limit of detection. We validated imGLAD by analyzing metagenomic datasets derived from spinach leaves inoculated with the enteric pathogen Escherichia coli O157:H7 and showed that its limit of detection can be comparable to that of PCR-based approaches for these samples (?1 cell/gram).

SUBMITTER: Castro JC 

PROVIDER: S-EPMC6216955 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

imGLAD: accurate detection and quantification of target organisms in metagenomes.

Castro Juan C JC   Rodriguez-R Luis M LM   Harvey William T WT   Weigand Michael R MR   Hatt Janet K JK   Carter Michelle Q MQ   Konstantinidis Konstantinos T KT  

PeerJ 20181102


Accurate detection of target microbial species in metagenomic datasets from environmental samples remains limited because the limit of detection of current methods is typically inaccessible and the frequency of false-positives, resulting from inadequate identification of regions of the genome that are either too highly conserved to be diagnostic (e.g., rRNA genes) or prone to frequent horizontal genetic exchange (e.g., mobile elements) remains unknown. To overcome these limitations, we introduce  ...[more]

Similar Datasets

| S-EPMC7111523 | biostudies-literature
| S-EPMC5388429 | biostudies-literature
| S-EPMC4562114 | biostudies-literature
| S-EPMC2656497 | biostudies-literature
| S-EPMC9808630 | biostudies-literature
| S-EPMC4643905 | biostudies-literature
| S-EPMC4069399 | biostudies-literature
| S-EPMC4632058 | biostudies-literature
| S-EPMC4816158 | biostudies-literature
| S-EPMC2322970 | biostudies-literature