Unknown

Dataset Information

0

Clustermq enables efficient parallelization of genomic analyses.


ABSTRACT:

Motivation

High performance computing (HPC) clusters play a pivotal role in large-scale bioinformatics analysis and modeling. For the statistical computing language R, packages exist to enable a user to submit their analyses as jobs on HPC schedulers. However, these packages do not scale well to high numbers of tasks, and their processing overhead quickly becomes a prohibitive bottleneck.

Results

Here we present clustermq, an R package that can process analyses up to three orders of magnitude faster than previously published alternatives. We show this for investigating genomic associations of drug sensitivity in cancer cell lines, but it can be applied to any kind of parallelizable workflow.

Availability and implementation

The package is available on CRAN and https://github.com/mschubert/clustermq. Code for performance testing is available at https://github.com/mschubert/clustermq-performance.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Schubert M 

PROVIDER: S-EPMC6821287 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

clustermq enables efficient parallelization of genomic analyses.

Schubert Michael M  

Bioinformatics (Oxford, England) 20191101 21


<h4>Motivation</h4>High performance computing (HPC) clusters play a pivotal role in large-scale bioinformatics analysis and modeling. For the statistical computing language R, packages exist to enable a user to submit their analyses as jobs on HPC schedulers. However, these packages do not scale well to high numbers of tasks, and their processing overhead quickly becomes a prohibitive bottleneck.<h4>Results</h4>Here we present clustermq, an R package that can process analyses up to three orders  ...[more]

Similar Datasets

| S-EPMC7313716 | biostudies-literature
| S-EPMC7182190 | biostudies-literature
| S-EPMC2713274 | biostudies-literature
| S-EPMC4069620 | biostudies-literature
| S-EPMC10005600 | biostudies-literature
| S-EPMC3412928 | biostudies-literature
| S-EPMC5507853 | biostudies-other
| S-EPMC6003447 | biostudies-literature
| S-EPMC3324005 | biostudies-literature
| S-EPMC8326735 | biostudies-literature