Ontology highlight
ABSTRACT:
SUBMITTER: Sameith K
PROVIDER: S-EPMC5221426 | biostudies-literature | 2017 Jan
REPOSITORIES: biostudies-literature
Sameith Katrin K Roscito Juliana G JG Hiller Michael M
Briefings in bioinformatics 20160210 1
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput, which is attractive for genome assembly. A first step in genome assembly is to computationally correct sequencing errors. However, correcting all errors in these longer reads is challenging. Here, we show that reads with remaining errors after correction often overlap repeats, where short erroneous k-mers occur in other copies of the repeat. We developed an iterative error correction pipeline tha ...[more]