Unknown

Dataset Information

0

DRISEE overestimates errors in metagenomic sequencing data.


ABSTRACT: The extremely high error rates reported by Keegan et al. in 'A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE' (PLoS Comput Biol 2012; 8: :e1002541) for many next-generation sequencing datasets prompted us to re-examine their results. Our analysis reveals that the presence of conserved artificial sequences, e.g. Illumina adapters, and other naturally occurring sequence motifs accounts for most of the reported errors. We conclude that DRISEE reports inflated levels of sequencing error, particularly for Illumina data. Tools offered for evaluating large datasets need scrupulous review before they are implemented.

SUBMITTER: Eren AM 

PROVIDER: S-EPMC4171678 | biostudies-literature | 2014 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

DRISEE overestimates errors in metagenomic sequencing data.

Eren A Murat AM   Morrison Hilary G HG   Huse Susan M SM   Sogin Mitchell L ML  

Briefings in bioinformatics 20130522 5


The extremely high error rates reported by Keegan et al. in 'A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE' (PLoS Comput Biol 2012; 8: :e1002541) for many next-generation sequencing datasets prompted us to re-examine their results. Our analysis reveals that the presence of conserved artificial sequences, e.g. Illumina adapters, and other naturally occurring sequence motifs accounts for most of the reported errors. We conclude that DRISEE reports inflat  ...[more]

Similar Datasets

| S-EPMC3369934 | biostudies-literature
| S-EPMC2781827 | biostudies-literature
| S-EPMC4719071 | biostudies-literature
| S-EPMC3852801 | biostudies-literature
| S-EPMC5654362 | biostudies-literature
| S-EPMC4240813 | biostudies-literature
| S-EPMC7829059 | biostudies-literature
| PRJEB55463 | ENA
| S-EPMC5666806 | biostudies-literature
| S-EPMC3285633 | biostudies-literature