Unknown

Dataset Information

0

Fast and accurate de novo genome assembly from long uncorrected reads.


ABSTRACT: The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment-based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster.

SUBMITTER: Vaser R 

PROVIDER: S-EPMC5411768 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast and accurate de novo genome assembly from long uncorrected reads.

Vaser Robert R   Sović Ivan I   Nagarajan Niranjan N   Šikić Mile M  

Genome research 20170118 5


The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment-based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that  ...[more]

Similar Datasets

| S-EPMC5770995 | biostudies-literature
| S-EPMC5765664 | biostudies-literature
| S-EPMC8590762 | biostudies-literature
| S-EPMC3158087 | biostudies-literature
| S-EPMC6925183 | biostudies-literature
| S-EPMC8549298 | biostudies-literature
| S-EPMC7488116 | biostudies-literature
| S-EPMC5543108 | biostudies-literature
| S-EPMC6487145 | biostudies-literature
| S-EPMC8085491 | biostudies-literature