Dataset Information

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI.

ABSTRACT:

Background

Next-generation sequencing technologies have led to the high-throughput production of sequence data (reads) at low cost. However, these reads are significantly shorter and more error-prone than conventional Sanger shotgun reads. This poses a challenge for the de novo assembly in terms of assembly quality and scalability for large-scale short read datasets.

Results

We present DecGPU, the first parallel and distributed error correction algorithm for high-throughput short reads (HTSRs) using a hybrid combination of CUDA and MPI parallel programming models. DecGPU provides CPU-based and GPU-based versions, where the CPU-based version employs coarse-grained and fine-grained parallelism using the MPI and OpenMP parallel programming models, and the GPU-based version takes advantage of the CUDA and MPI parallel programming models and employs a hybrid CPU+GPU computing model to maximize the performance by overlapping the CPU and GPU computation. The distributed feature of our algorithm makes it feasible and flexible for the error correction of large-scale HTSR datasets. Using simulated and real datasets, our algorithm demonstrates superior performance, in terms of error correction quality and execution speed, to the existing error correction algorithms. Furthermore, when combined with Velvet and ABySS, the resulting DecGPU-Velvet and DecGPU-ABySS assemblers demonstrate the potential of our algorithm to improve de novo assembly quality for de-Bruijn-graph-based assemblers.

Conclusions

DecGPU is publicly available open-source software, written in CUDA C++ and MPI. The experimental results suggest that DecGPU is an effective and feasible error correction algorithm to tackle the flood of short reads produced by next-generation sequencing technologies.

SUBMITTER: Liu Y

PROVIDER: S-EPMC3072957 | biostudies-literature | 2011 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI.

Liu Yongchao Y Schmidt Bertil B Maskell Douglas L DL

BMC bioinformatics 20110329

<h4>Background</h4>Next-generation sequencing technologies have led to the high-throughput production of sequence data (reads) at low cost. However, these reads are significantly shorter and more error-prone than conventional Sanger shotgun reads. This poses a challenge for the de novo assembly in terms of assembly quality and scalability for large-scale short read datasets.<h4>Results</h4>We present DecGPU, the first parallel and distributed error correction algorithm for high-throughput short ...[more]

PMID: 21447171

Dataset Information

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI.

Background

Results

Conclusions

Publications

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units.
| S-EPMC2694204 | biostudies-literature

Accelerating the Gillespie Exact Stochastic Simulation Algorithm using hybrid parallel execution on graphics processing units.
| S-EPMC3494724 | biostudies-literature

High-throughput sequence alignment using Graphics Processing Units.
| S-EPMC2222658 | biostudies-literature

Fast docking on graphics processing units via Ray-Casting.
| S-EPMC3745428 | biostudies-literature

dadi.CUDA: Accelerating Population Genetics Inference with Graphics Processing Units.
| S-EPMC8097298 | biostudies-literature

permGPU: Using graphics processing units in RNA microarray association studies.
| S-EPMC2910023 | biostudies-literature

Graphics processing units in bioinformatics, computational biology and systems biology.
| S-EPMC5862309 | biostudies-literature

Accelerating large-scale protein structure alignments with graphics processing units.
| S-EPMC3309952 | biostudies-literature

Mendel-GPU: haplotyping and genotype imputation on graphics processing units.
| S-EPMC3634317 | biostudies-literature

Accelerating the Gillespie τ-Leaping Method using graphics processing units.
| S-EPMC3371023 | biostudies-literature