Dataset Information

Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data.

ABSTRACT:

Background

The advent of pyrophosphate sequencing makes large volumes of sequencing data available at a lower cost than previously possible. However, the short read lengths are difficult to assemble and the large dataset is difficult to handle. During the sequencing of a virus from the tsetse fly, Glossina pallidipes, we found the need for tools to search quickly a set of reads for near exact text matches.

Methods

A set of tools is provided to search a large data set of pyrophosphate sequence reads under a "live" CD version of Linux on a standard PC that can be used by anyone without prior knowledge of Linux and without having to install a Linux setup on the computer. The tools permit short lengths of de novo assembly, checking of existing assembled sequences, selection and display of reads from the data set and gathering counts of sequences in the reads.

Results

Demonstrations are given of the use of the tools to help with checking an assembly against the fragment data set; investigating homopolymer lengths, repeat regions and polymorphisms; and resolving inserted bases caused by incomplete chain extension.

Conclusion

The additional information contained in a pyrophosphate sequencing data set beyond a basic assembly is difficult to access due to a lack of tools. The set of simple tools presented here would allow anyone with basic computer skills and a standard PC to access this information.

SUBMITTER: Parker NJ

PROVIDER: S-EPMC2374781 | biostudies-literature | 2008 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data.

Parker Nicolas J NJ Parker Andrew G AG

Source code for biology and medicine 20080418

<h4>Background</h4>The advent of pyrophosphate sequencing makes large volumes of sequencing data available at a lower cost than previously possible. However, the short read lengths are difficult to assemble and the large dataset is difficult to handle. During the sequencing of a virus from the tsetse fly, Glossina pallidipes, we found the need for tools to search quickly a set of reads for near exact text matches.<h4>Methods</h4>A set of tools is provided to search a large data set of pyrophosph ...[more]

PMID: 18423012

Dataset Information

Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data.

Background

Methods

Results

Conclusion

Publications

Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Genome sequencing in microfabricated high-density picolitre reactors.
| S-EPMC1464427 | biostudies-literature

A high-density simple sequence repeat-based genetic linkage map of switchgrass.
| S-EPMC3291506 | biostudies-other

Next-generation high-density self-assembling functional protein arrays.
| S-EPMC3070491 | biostudies-literature

Metavisitor, a Suite of Galaxy Tools for Simple and Rapid Detection and Discovery of Viruses in Deep Sequence Data.
| S-EPMC5207757 | biostudies-literature

A high-density simple sequence repeat and single nucleotide polymorphism genetic map of the tetraploid cotton genome.
| S-EPMC3276184 | biostudies-literature

Kraken: a set of tools for quality control and analysis of high-throughput sequence data.
| S-EPMC3991327 | biostudies-literature

Assembling carbon quantum dots to a layered carbon for high-density supercapacitor electrodes.
| S-EPMC4709517 | biostudies-literature

EpiBuilder: A Tool for Assembling, Searching, and Classifying B-Cell Epitopes.
| S-EPMC9102138 | biostudies-literature

Construction of an integrated high density simple sequence repeat linkage map in cultivated strawberry (Fragaria × ananassa) and its applicability.
| S-EPMC3576660 | biostudies-literature

Development of ultra-high-density screening tools for microbial "omics".
| S-EPMC3897414 | biostudies-literature