Unknown

Dataset Information

0

EBIC: an open source software for high-dimensional and big data analyses.


ABSTRACT: MOTIVATION:In this paper, we present an open source package with the latest release of Evolutionary-based BIClustering (EBIC), a next-generation biclustering algorithm for mining genetic data. The major contribution of this paper is adding a full support for multiple graphics processing units (GPUs) support, which makes it possible to run efficiently large genomic data mining analyses. Multiple enhancements to the first release of the algorithm include integration with R and Bioconductor, and an option to exclude missing values from the analysis. RESULTS:Evolutionary-based BIClustering was applied to datasets of different sizes, including a large DNA methylation dataset with 436 444 rows. For the largest dataset we observed over 6.6-fold speedup in computation time on a cluster of eight GPUs compared to running the method on a single GPU. This proves high scalability of the method. AVAILABILITY AND IMPLEMENTATION:The latest version of EBIC could be downloaded from http://github.com/EpistasisLab/ebic. Installation and usage instructions are also available online. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Orzechowski P 

PROVIDER: S-EPMC6736067 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

EBIC: an open source software for high-dimensional and big data analyses.

Orzechowski Patryk P   Moore Jason H JH  

Bioinformatics (Oxford, England) 20190901 17


<h4>Motivation</h4>In this paper, we present an open source package with the latest release of Evolutionary-based BIClustering (EBIC), a next-generation biclustering algorithm for mining genetic data. The major contribution of this paper is adding a full support for multiple graphics processing units (GPUs) support, which makes it possible to run efficiently large genomic data mining analyses. Multiple enhancements to the first release of the algorithm include integration with R and Bioconductor  ...[more]

Similar Datasets

| S-EPMC5530315 | biostudies-literature
| S-EPMC6609732 | biostudies-literature
| S-EPMC9047447 | biostudies-literature
| S-EPMC7644112 | biostudies-literature
| S-EPMC8321150 | biostudies-literature
| S-EPMC10079087 | biostudies-literature
| S-EPMC7228781 | biostudies-literature
| S-EPMC7336365 | biostudies-literature
| S-EPMC8209617 | biostudies-literature
| S-EPMC4387197 | biostudies-literature