Unknown

Dataset Information

0

De novo sequencing and variant calling with nanopores using PoreSeq.


ABSTRACT: The accuracy of sequencing single DNA molecules with nanopores is continually improving, but de novo genome sequencing and assembly using only nanopore data remain challenging. Here we describe PoreSeq, an algorithm that identifies and corrects errors in nanopore sequencing data and improves the accuracy of de novo genome assembly with increasing coverage depth. The approach relies on modeling the possible sources of uncertainty that occur as DNA transits through the nanopore and finds the sequence that best explains multiple reads of the same region. PoreSeq increases nanopore sequencing read accuracy of M13 bacteriophage DNA from 85% to 99% at 100× coverage. We also use the algorithm to assemble Escherichia coli with 30× coverage and the ? genome at a range of coverages from 3× to 50×. Additionally, we classify sequence variants at an order of magnitude lower coverage than is possible with existing methods.

SUBMITTER: Szalay T 

PROVIDER: S-EPMC4877053 | biostudies-literature | 2015 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

De novo sequencing and variant calling with nanopores using PoreSeq.

Szalay Tamas T   Golovchenko Jene A JA  

Nature biotechnology 20150909 10


The accuracy of sequencing single DNA molecules with nanopores is continually improving, but de novo genome sequencing and assembly using only nanopore data remain challenging. Here we describe PoreSeq, an algorithm that identifies and corrects errors in nanopore sequencing data and improves the accuracy of de novo genome assembly with increasing coverage depth. The approach relies on modeling the possible sources of uncertainty that occur as DNA transits through the nanopore and finds the seque  ...[more]

Similar Datasets

| S-EPMC10777354 | biostudies-literature
| S-EPMC9900919 | biostudies-literature
| S-EPMC8545295 | biostudies-literature
| S-EPMC9771978 | biostudies-literature
| S-EPMC6123722 | biostudies-literature
| S-EPMC6449753 | biostudies-literature
| S-EPMC6210158 | biostudies-literature
| S-EPMC5937174 | biostudies-literature
| S-EPMC11231365 | biostudies-literature
| S-EPMC8141913 | biostudies-literature