Unknown

Dataset Information

0

ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing.


ABSTRACT:

Motivation

Nanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either apply signal alignment, which cannot handle human-sized reference sequences, or apply read mapping in sequence space relying on fast graphical processing units (GPU) base callers for real-time read rejection. Using nanopore long-read mapping tools is also not optimal when mapping shorter reads as usually analyzed in adaptive sampling applications.

Results

Here, we present a new approach for nanopore adaptive sampling that combines fast CPU and GPU base calling with read classification based on Interleaved Bloom Filters. ReadBouncer improves the potential enrichment of low abundance sequences by its high read classification sensitivity and specificity, outperforming existing tools in the field. It robustly removes even reads belonging to large reference sequences while running on commodity hardware without GPUs, making adaptive sampling accessible for in-field researchers. Readbouncer also provides a user-friendly interface and installer files for end-users without a bioinformatics background.

Availability and implementation

The C++ source code is available at https://gitlab.com/dacs-hpi/readbouncer.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Ulrich JU 

PROVIDER: S-EPMC9235500 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8785595 | biostudies-literature
| S-EPMC7096756 | biostudies-literature
| S-EPMC8722758 | biostudies-literature
| S-EPMC5515536 | biostudies-literature
| S-EPMC8499546 | biostudies-literature
| S-EPMC9244360 | biostudies-literature
| S-EPMC5428259 | biostudies-literature
| S-EPMC2941267 | biostudies-literature
| S-EPMC7815314 | biostudies-literature
| S-EPMC7319573 | biostudies-literature