Unknown

Dataset Information

0

Fulcrum: condensing redundant reads from high-throughput sequencing studies.


ABSTRACT:

Motivation

Ultra-high-throughput sequencing produces duplicate and near-duplicate reads, which can consume computational resources in downstream applications. A tool that collapses such reads should reduce storage and assembly complications and costs.

Results

We developed Fulcrum to collapse identical and near-identical Illumina and 454 reads (such as those from PCR clones) into single error-corrected sequences; it can process paired-end as well as single-end reads. Fulcrum is customizable and can be deployed on a single machine, a local network or a commercially available MapReduce cluster, and it has been optimized to maximize ease-of-use, cross-platform compatibility and future scalability. Sequence datasets have been collapsed by up to 71%, and the reduced number and improved quality of the resulting sequences allow assemblers to produce longer contigs while using less memory.

SUBMITTER: Burriesci MS 

PROVIDER: S-EPMC3348557 | biostudies-literature | 2012 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fulcrum: condensing redundant reads from high-throughput sequencing studies.

Burriesci Matthew S MS   Lehnert Erik M EM   Pringle John R JR  

Bioinformatics (Oxford, England) 20120313 10


<h4>Motivation</h4>Ultra-high-throughput sequencing produces duplicate and near-duplicate reads, which can consume computational resources in downstream applications. A tool that collapses such reads should reduce storage and assembly complications and costs.<h4>Results</h4>We developed Fulcrum to collapse identical and near-identical Illumina and 454 reads (such as those from PCR clones) into single error-corrected sequences; it can process paired-end as well as single-end reads. Fulcrum is cus  ...[more]

Similar Datasets

| S-EPMC4895285 | biostudies-literature
| S-EPMC6365934 | biostudies-literature
| S-EPMC3119603 | biostudies-literature
| S-EPMC4168710 | biostudies-literature
| S-EPMC3848435 | biostudies-literature
| S-EPMC4117746 | biostudies-literature
| S-EPMC4071206 | biostudies-literature
| S-EPMC4948903 | biostudies-literature
| S-EPMC5845169 | biostudies-literature
| S-EPMC4133078 | biostudies-literature