Unknown

Dataset Information

0

Scaffolding and completing genome assemblies in real-time with nanopore sequencing.


ABSTRACT: Third generation sequencing technologies provide the opportunity to improve genome assemblies by generating long reads spanning most repeat sequences. However, current analysis methods require substantial amounts of sequence data and computational resources to overcome the high error rates. Furthermore, they can only perform analysis after sequencing has completed, resulting in either over-sequencing, or in a low quality assembly due to under-sequencing. Here we present npScarf, which can scaffold and complete short read assemblies while the long read sequencing run is in progress. It reports assembly metrics in real-time so the sequencing run can be terminated once an assembly of sufficient quality is obtained. In assembling four bacterial and one eukaryotic genomes, we show that npScarf can construct more complete and accurate assemblies while requiring less sequencing data and computational resources than existing methods. Our approach offers a time- and resource-effective strategy for completing short read assemblies.

SUBMITTER: Cao MD 

PROVIDER: S-EPMC5321748 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Scaffolding and completing genome assemblies in real-time with nanopore sequencing.

Cao Minh Duc MD   Nguyen Son Hoang SH   Ganesamoorthy Devika D   Elliott Alysha G AG   Cooper Matthew A MA   Coin Lachlan J M LJ  

Nature communications 20170220


Third generation sequencing technologies provide the opportunity to improve genome assemblies by generating long reads spanning most repeat sequences. However, current analysis methods require substantial amounts of sequence data and computational resources to overcome the high error rates. Furthermore, they can only perform analysis after sequencing has completed, resulting in either over-sequencing, or in a low quality assembly due to under-sequencing. Here we present npScarf, which can scaffo  ...[more]

Similar Datasets

| S-EPMC5695209 | biostudies-literature
| S-EPMC4493687 | biostudies-literature
| S-EPMC5008457 | biostudies-literature
| S-EPMC4348652 | biostudies-literature
| S-EPMC6169393 | biostudies-literature
| S-EPMC4702336 | biostudies-literature
| S-EPMC6161345 | biostudies-literature
| S-EPMC5731603 | biostudies-literature
| S-EPMC7768660 | biostudies-literature
| S-EPMC5566789 | biostudies-literature