Unknown

Dataset Information

0

DOPA: GPU-based protein alignment using database and memory access optimizations.


ABSTRACT: BACKGROUND:Smith-Waterman (S-W) algorithm is an optimal sequence alignment method for biological databases, but its computational complexity makes it too slow for practical purposes. Heuristics based approximate methods like FASTA and BLAST provide faster solutions but at the cost of reduced accuracy. Also, the expanding volume and varying lengths of sequences necessitate performance efficient restructuring of these databases. Thus to come up with an accurate and fast solution, it is highly desired to speed up the S-W algorithm. FINDINGS:This paper presents a high performance protein sequence alignment implementation for Graphics Processing Units (GPUs). The new implementation improves performance by optimizing the database organization and reducing the number of memory accesses to eliminate bandwidth bottlenecks. The implementation is called Database Optimized Protein Alignment (DOPA) and it achieves a performance of 21.4 Giga Cell Updates Per Second (GCUPS), which is 1.13 times better than the fastest GPU implementation to date. CONCLUSIONS:In the new GPU-based implementation for protein sequence alignment (DOPA), the database is organized in equal length sequence sets. This equally distributes the workload among all the threads on the GPU's multiprocessors. The result is an improved performance which is better than the fastest available GPU implementation.

SUBMITTER: Hasan L 

PROVIDER: S-EPMC3166271 | biostudies-literature | 2011 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

DOPA: GPU-based protein alignment using database and memory access optimizations.

Hasan Laiq L   Kentie Marijn M   Al-Ars Zaid Z  

BMC research notes 20110728


<h4>Background</h4>Smith-Waterman (S-W) algorithm is an optimal sequence alignment method for biological databases, but its computational complexity makes it too slow for practical purposes. Heuristics based approximate methods like FASTA and BLAST provide faster solutions but at the cost of reduced accuracy. Also, the expanding volume and varying lengths of sequences necessitate performance efficient restructuring of these databases. Thus to come up with an accurate and fast solution, it is hig  ...[more]

Similar Datasets

| S-EPMC5749749 | biostudies-literature
| S-EPMC5691400 | biostudies-literature
| S-EPMC2279924 | biostudies-literature
| S-EPMC6061805 | biostudies-literature
| S-EPMC3413391 | biostudies-literature
| S-EPMC139977 | biostudies-literature
| S-EPMC308761 | biostudies-literature
| S-EPMC3637623 | biostudies-literature
| S-EPMC4699916 | biostudies-literature