Unknown

Dataset Information

0

SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information.


ABSTRACT:

Background

The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data.

Results

Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50× coverage of uncorrected PacBio RS long reads is sufficient to drastically reduce the number of contigs. Comparisons to the AHA scaffolder indicate our strategy is better capable of producing (nearly) complete bacterial genomes.

Conclusions

The current work describes our SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences. We conclude that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run our program, allow to scaffold genomes in a fast and reliable manner.

SUBMITTER: Boetzer M 

PROVIDER: S-EPMC4076250 | biostudies-literature | 2014 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information.

Boetzer Marten M   Pirovano Walter W  

BMC bioinformatics 20140620


<h4>Background</h4>The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data.<h4>Results</h4>Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner u  ...[more]

Similar Datasets

| S-EPMC5508778 | biostudies-literature
| S-EPMC4524009 | biostudies-literature
| S-EPMC8442456 | biostudies-literature
| S-EPMC6325685 | biostudies-literature
| S-EPMC6902338 | biostudies-literature
| S-EPMC6816165 | biostudies-literature
| S-EPMC8103633 | biostudies-literature
| S-EPMC3975067 | biostudies-literature
| S-EPMC8354527 | biostudies-literature
| S-EPMC4558774 | biostudies-literature