Unknown

Dataset Information

0

Single molecule sequencing-guided scaffolding and correction of draft assemblies.


ABSTRACT: Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies.We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-corrected single molecule sequences. To achieve this goal, we formulate a maximum alternating path cover problem. We prove that this problem is NP-hard, and solve it by a 2-approximation algorithm.Our experimental results show that our approach can improve the structural correctness of target assemblies in the cost of some contiguity, even with smaller amounts of long reads. In addition, our reassembling process can also serve as a competitive scaffolder relative to well-established assembly benchmarks.

SUBMITTER: Zhu S 

PROVIDER: S-EPMC5731603 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Single molecule sequencing-guided scaffolding and correction of draft assemblies.

Zhu Shenglong S   Chen Danny Z DZ   Emrich Scott J SJ  

BMC genomics 20171206 Suppl 10


<h4>Background</h4>Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies.<h4>Results</h4>We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-cor  ...[more]

Similar Datasets

| S-EPMC6816165 | biostudies-literature
| S-EPMC5321748 | biostudies-literature
| S-EPMC3707490 | biostudies-literature
| S-EPMC5299676 | biostudies-literature
| S-EPMC6078200 | biostudies-literature
| S-EPMC4262078 | biostudies-literature
| S-EPMC7242662 | biostudies-literature
| S-EPMC4895481 | biostudies-literature
| S-EPMC5240625 | biostudies-literature
| S-EPMC6851084 | biostudies-literature