Unknown

Dataset Information

0

GAGE-B: an evaluation of genome assemblers for bacterial organisms.


ABSTRACT: MOTIVATION:A large and rapidly growing number of bacterial organisms have been sequenced by the newest sequencing technologies. Cheaper and faster sequencing technologies make it easy to generate very high coverage of bacterial genomes, but these advances mean that DNA preparation costs can exceed the cost of sequencing for small genomes. The need to contain costs often results in the creation of only a single sequencing library, which in turn introduces new challenges for genome assembly methods. RESULTS:We evaluated the ability of multiple genome assembly programs to assemble bacterial genomes from a single, deep-coverage library. For our comparison, we chose bacterial species spanning a wide range of GC content and measured the contiguity and accuracy of the resulting assemblies. We compared the assemblies produced by this very high-coverage, one-library strategy to the best assemblies created by two-library sequencing, and we found that remarkably good bacterial assemblies are possible with just one library. We also measured the effect of read length and depth of coverage on assembly quality and determined the values that provide the best results with current algorithms. CONTACT:salzberg@jhu.edu SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Magoc T 

PROVIDER: S-EPMC3702249 | biostudies-literature | 2013 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

GAGE-B: an evaluation of genome assemblers for bacterial organisms.

Magoc Tanja T   Pabinger Stephan S   Canzar Stefan S   Liu Xinyue X   Su Qi Q   Puiu Daniela D   Tallon Luke J LJ   Salzberg Steven L SL  

Bioinformatics (Oxford, England) 20130510 14


<h4>Motivation</h4>A large and rapidly growing number of bacterial organisms have been sequenced by the newest sequencing technologies. Cheaper and faster sequencing technologies make it easy to generate very high coverage of bacterial genomes, but these advances mean that DNA preparation costs can exceed the cost of sequencing for small genomes. The need to contain costs often results in the creation of only a single sequencing library, which in turn introduces new challenges for genome assembl  ...[more]

Similar Datasets

| S-EPMC3290791 | biostudies-literature
| S-EPMC6966772 | biostudies-literature
| S-EPMC3785575 | biostudies-literature
| S-EPMC7462071 | biostudies-literature
| S-EPMC7730629 | biostudies-literature
| S-EPMC9464240 | biostudies-literature
| S-EPMC5054208 | biostudies-literature
| S-EPMC11323187 | biostudies-literature
| S-EPMC5826002 | biostudies-literature
| S-EPMC1150276 | biostudies-literature