Unknown

Dataset Information

0

Mauve assembly metrics.


ABSTRACT:

Summary

High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics. We developed a system to measure assembly quality under several scoring metrics, and to compare assembly quality across a variety of assemblers, sequence data types, and parameter choices. When used in conjunction with training data such as a high-quality reference genome and sequence reads from the same organism, our program can be used to manually identify an optimal sequencing and assembly strategy for de novo sequencing of related organisms.

Availability

GPL source code and a usage tutorial is at http://ngopt.googlecode.com

Contact

aarondarling@ucdavis.edu

Supplementary information

Supplementary data is available at Bioinformatics online.

SUBMITTER: Darling AE 

PROVIDER: S-EPMC3179657 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mauve assembly metrics.

Darling Aaron E AE   Tritt Andrew A   Eisen Jonathan A JA   Facciotti Marc T MT  

Bioinformatics (Oxford, England) 20110802 19


<h4>Summary</h4>High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics. We developed a system to measure assembly quality under several scori  ...[more]

Similar Datasets

| S-EPMC2723005 | biostudies-literature
| S-EPMC3733778 | biostudies-literature
| S-EPMC6819997 | biostudies-literature
| S-EPMC4932478 | biostudies-literature
| S-EPMC6030666 | biostudies-literature
| S-EPMC8024676 | biostudies-literature
2023-09-29 | GSE244325 | GEO
| S-EPMC8362059 | biostudies-literature
2023-05-10 | GSE215175 | GEO
2012-10-01 | E-GEOD-33063 | biostudies-arrayexpress