Dataset Information

Improving read mapping using additional prefix grams.

ABSTRACT:

Background

Next-generation sequencing (NGS) enables rapid production of billions of bases at a relatively low cost. Mapping reads from next-generation sequencers to a given reference genome is an important first step in many sequencing applications. Popular read mappers, such as Bowtie and BWA, are optimized to return top one or a few candidate locations of each read. However, identifying all mapping locations of each read, instead of just one or a few, is also important in some sequencing applications such as ChIP-seq for discovering binding sites in repeat regions, and RNA-seq for transcript abundance estimation.

Results

Here we present Hobbes2, a software package designed for fast and accurate alignment of NGS reads and specialized in identifying all mapping locations of each read. Hobbes2 efficiently identifies all mapping locations of reads using a novel technique that utilizes additional prefix q-grams to improve filtering. We extensively compare Hobbes2 with state-of-the-art read mappers, and show that Hobbes2 can be an order of magnitude faster than other read mappers while consuming less memory space and achieving similar accuracy.

Conclusions

We propose Hobbes2 to improve the accuracy of read mapping, specialized in identifying all mapping locations of each read. Hobbes2 is implemented in C++, and the source code is freely available for download at http://hobbes.ics.uci.edu.

SUBMITTER: Kim J

PROVIDER: S-EPMC3927682 | biostudies-literature | 2014 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Improving read mapping using additional prefix grams.

Kim Jongik J Li Chen C Xie Xiaohui X

BMC bioinformatics 20140205

<h4>Background</h4>Next-generation sequencing (NGS) enables rapid production of billions of bases at a relatively low cost. Mapping reads from next-generation sequencers to a given reference genome is an important first step in many sequencing applications. Popular read mappers, such as Bowtie and BWA, are optimized to return top one or a few candidate locations of each read. However, identifying all mapping locations of each read, instead of just one or a few, is also important in some sequenci ...[more]

PMID: 24499321

Dataset Information

Improving read mapping using additional prefix grams.

Background

Results

Conclusions

Publications

Improving read mapping using additional prefix grams.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Improving ancient DNA read mapping against modern reference genomes.
| S-EPMC3468387 | biostudies-literature

Improving in-silico normalization using read weights.
| S-EPMC6435659 | biostudies-literature

Long-read mapping to repetitive reference sequences using Winnowmap2.
| S-EPMC10510034 | biostudies-literature

Improving and correcting the contiguity of long-read genome assemblies of three plant species using optical mapping and chromosome conformation capture data.
| S-EPMC5411772 | biostudies-literature

Smoother: on-the-fly processing of interactome data using prefix sums.
| S-EPMC10954447 | biostudies-literature

Improving PacBio long read accuracy by short read alignment.
| S-EPMC3464235 | biostudies-literature

Sensitive gene fusion detection using ambiguously mapping RNA-Seq read pairs.
| S-EPMC3072550 | biostudies-literature

Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs.
| S-EPMC3599684 | biostudies-literature

Using quality scores and longer reads improves accuracy of Solexa read mapping.
| S-EPMC2335322 | biostudies-literature

Improving the ostrich genome assembly using optical mapping data.
| S-EPMC4427950 | biostudies-literature