Unknown

Dataset Information

0

Shouji: a fast and efficient pre-alignment filter for sequence alignment.


ABSTRACT: MOTIVATION:The ability to generate massive amounts of sequencing data continues to overwhelm the processing capability of existing algorithms and compute infrastructures. In this work, we explore the use of hardware/software co-design and hardware acceleration to significantly reduce the execution time of short sequence alignment, a crucial step in analyzing sequenced genomes. We introduce Shouji, a highly parallel and accurate pre-alignment filter that remarkably reduces the need for computationally-costly dynamic programming algorithms. The first key idea of our proposed pre-alignment filter is to provide high filtering accuracy by correctly detecting all common subsequences shared between two given sequences. The second key idea is to design a hardware accelerator that adopts modern field-programmable gate array (FPGA) architectures to further boost the performance of our algorithm. RESULTS:Shouji significantly improves the accuracy of pre-alignment filtering by up to two orders of magnitude compared to the state-of-the-art pre-alignment filters, GateKeeper and SHD. Our FPGA-based accelerator is up to three orders of magnitude faster than the equivalent CPU implementation of Shouji. Using a single FPGA chip, we benchmark the benefits of integrating Shouji with five state-of-the-art sequence aligners, designed for different computing platforms. The addition of Shouji as a pre-alignment step reduces the execution time of the five state-of-the-art sequence aligners by up to 18.8×. Shouji can be adapted for any bioinformatics pipeline that performs sequence alignment for verification. Unlike most existing methods that aim to accelerate sequence alignment, Shouji does not sacrifice any of the aligner capabilities, as it does not modify or replace the alignment step. AVAILABILITY AND IMPLEMENTATION:https://github.com/CMU-SAFARI/Shouji. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Alser M 

PROVIDER: S-EPMC6821304 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Shouji: a fast and efficient pre-alignment filter for sequence alignment.

Alser Mohammed M   Hassan Hasan H   Kumar Akash A   Mutlu Onur O   Alkan Can C  

Bioinformatics (Oxford, England) 20191101 21


<h4>Motivation</h4>The ability to generate massive amounts of sequencing data continues to overwhelm the processing capability of existing algorithms and compute infrastructures. In this work, we explore the use of hardware/software co-design and hardware acceleration to significantly reduce the execution time of short sequence alignment, a crucial step in analyzing sequenced genomes. We introduce Shouji, a highly parallel and accurate pre-alignment filter that remarkably reduces the need for co  ...[more]

Similar Datasets

| S-EPMC8313605 | biostudies-literature
| S-EPMC3638164 | biostudies-other
| S-EPMC1579236 | biostudies-literature
| S-EPMC4080745 | biostudies-literature
| S-EPMC2893182 | biostudies-literature
| S-EPMC3999979 | biostudies-literature
| S-EPMC5037421 | biostudies-literature
| S-EPMC3934876 | biostudies-literature
| S-EPMC3495709 | biostudies-literature
| S-EPMC4382095 | biostudies-literature