Ontology highlight
ABSTRACT:
SUBMITTER: Seiler E
PROVIDER: S-EPMC8313605 | biostudies-literature | 2021 Jul
REPOSITORIES: biostudies-literature
Seiler Enrico E Mehringer Svenja S Darvish Mitra M Turc Etienne E Reinert Knut K
iScience 20210624 7
We present Raptor, a system for approximately searching many queries such as next-generation sequencing reads or transcripts in large collections of nucleotide sequences. Raptor uses winnowing minimizers to define a set of representative <i>k</i>-mers, an extension of the interleaved Bloom filters (IBFs) as a set membership data structure and probabilistic thresholding for minimizers. Our approach allows compression and partitioning of the IBF to enable the effective use of secondary memory. We ...[more]