Unknown

Dataset Information

0

BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters.


ABSTRACT: Large datasets can be screened for sequences from a specific organism, quickly and with low memory requirements, by a data structure that supports time- and memory-efficient set membership queries. Bloom filters offer such queries but require that false positives be controlled. We present BioBloom Tools, a Bloom filter-based sequence-screening tool that is faster than BWA, Bowtie 2 (popular alignment algorithms) and FACS (a membership query algorithm). It delivers accuracies comparable with these tools, controls false positives and has low memory requirements. Availability and implementaion: www.bcgsc.ca/platform/bioinfo/software/biobloomtools.

SUBMITTER: Chu J 

PROVIDER: S-EPMC4816029 | biostudies-literature | 2014 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters.

Chu Justin J   Sadeghi Sara S   Raymond Anthony A   Jackman Shaun D SD   Nip Ka Ming KM   Mar Richard R   Mohamadi Hamid H   Butterfield Yaron S YS   Robertson A Gordon AG   Birol Inanç I  

Bioinformatics (Oxford, England) 20140820 23


Large datasets can be screened for sequences from a specific organism, quickly and with low memory requirements, by a data structure that supports time- and memory-efficient set membership queries. Bloom filters offer such queries but require that false positives be controlled. We present BioBloom Tools, a Bloom filter-based sequence-screening tool that is faster than BWA, Bowtie 2 (popular alignment algorithms) and FACS (a membership query algorithm). It delivers accuracies comparable with thes  ...[more]

Similar Datasets

| S-EPMC6280799 | biostudies-other
| S-EPMC5467106 | biostudies-literature
| S-EPMC3507659 | biostudies-literature
| S-EPMC3974045 | biostudies-literature
| S-EPMC9853099 | biostudies-literature
| S-EPMC2887045 | biostudies-literature
| S-EPMC6074839 | biostudies-literature
| S-EPMC6792110 | biostudies-literature
| S-EPMC9440141 | biostudies-literature
| S-EPMC6812468 | biostudies-literature