Unknown

Dataset Information

0

Puffaligner : A Fast, Efficient, and Accurate Aligner Based on the Pufferfish Index.


ABSTRACT:

Motivation

Sequence alignment is one of the first steps in many modern genomic analyses, such as variant detection, transcript abundance estimation and metagenomic profiling. Unfortunately, it is often a computationally expensive procedure. As the quantity of data and wealth of different assays and applications continue to grow, the need for accurate and fast alignment tools that scale to large collections of reference sequences persists.

Results

In this paper, we introduce PuffAligner, a fast, accurate and versatile aligner built on top of the Pufferfish index. PuffAligner is able to produce highly-sensitive alignments, similar to those of Bowtie2, but much more quickly. While exhibiting similar speed to the ultrafast STAR aligner, PuffAligner requires considerably less memory to construct its index and align reads. PuffAligner strikes a desirable balance with respect to the time, space, and accuracy tradeoffs made by different alignment tools, and provides a promising foundation on which to test new alignment ideas over large collections of sequences.

Availability

PuffAligner is a free and open-source software. It is implemented in C ++14 and can be obtained from https://github.com/COMBINE-lab/pufferfish/tree/cigar-strings.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Almodaresi F 

PROVIDER: S-EPMC9502150 | biostudies-literature | 2021 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

PuffAligner: a fast, efficient and accurate aligner based on the Pufferfish index.

Almodaresi Fatemeh F   Zakeri Mohsen M   Patro Rob R  

Bioinformatics (Oxford, England) 20211101 22


<h4>Motivation</h4>Sequence alignment is one of the first steps in many modern genomic analyses, such as variant detection, transcript abundance estimation and metagenomic profiling. Unfortunately, it is often a computationally expensive procedure. As the quantity of data and wealth of different assays and applications continue to grow, the need for accurate and fast alignment tools that scale to large collections of reference sequences persists.<h4>Results</h4>In this article, we introduce Puff  ...[more]

Similar Datasets

| S-EPMC3669295 | biostudies-literature
| S-EPMC3634467 | biostudies-literature
| S-EPMC3664803 | biostudies-literature
| S-EPMC2732315 | biostudies-literature
| S-EPMC6913027 | biostudies-literature
| S-EPMC5845352 | biostudies-literature
| S-EPMC4907389 | biostudies-literature
| S-EPMC7750957 | biostudies-literature
| S-EPMC3548894 | biostudies-literature
| S-EPMC6280799 | biostudies-literature