Unknown

Dataset Information

0

Reproducible Bioconductor workflows using browser-based interactive notebooks and containers.


ABSTRACT: Objective:Bioinformatics publications typically include complex software workflows that are difficult to describe in a manuscript. We describe and demonstrate the use of interactive software notebooks to document and distribute bioinformatics research. We provide a user-friendly tool, BiocImageBuilder, that allows users to easily distribute their bioinformatics protocols through interactive notebooks uploaded to either a GitHub repository or a private server. Materials and methods:We present four different interactive Jupyter notebooks using R and Bioconductor workflows to infer differential gene expression, analyze cross-platform datasets, process RNA-seq data and KinomeScan data. These interactive notebooks are available on GitHub. The analytical results can be viewed in a browser. Most importantly, the software contents can be executed and modified. This is accomplished using Binder, which runs the notebook inside software containers, thus avoiding the need to install any software and ensuring reproducibility. All the notebooks were produced using custom files generated by BiocImageBuilder. Results:BiocImageBuilder facilitates the publication of workflows with a point-and-click user interface. We demonstrate that interactive notebooks can be used to disseminate a wide range of bioinformatics analyses. The use of software containers to mirror the original software environment ensures reproducibility of results. Parameters and code can be dynamically modified, allowing for robust verification of published results and encouraging rapid adoption of new methods. Conclusion:Given the increasing complexity of bioinformatics workflows, we anticipate that these interactive software notebooks will become as necessary for documenting software methods as traditional laboratory notebooks have been for documenting bench protocols, and as ubiquitous.

SUBMITTER: Almugbel R 

PROVIDER: S-EPMC6381817 | biostudies-literature | 2018 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reproducible Bioconductor workflows using browser-based interactive notebooks and containers.

Almugbel Reem R   Hung Ling-Hong LH   Hu Jiaming J   Almutairy Abeer A   Ortogero Nicole N   Tamta Yashaswi Y   Yeung Ka Yee KY  

Journal of the American Medical Informatics Association : JAMIA 20180101 1


<h4>Objective</h4>Bioinformatics publications typically include complex software workflows that are difficult to describe in a manuscript. We describe and demonstrate the use of interactive software notebooks to document and distribute bioinformatics research. We provide a user-friendly tool, BiocImageBuilder, that allows users to easily distribute their bioinformatics protocols through interactive notebooks uploaded to either a GitHub repository or a private server.<h4>Materials and methods</h4  ...[more]

Similar Datasets

| S-EPMC2935447 | biostudies-literature
| S-EPMC6223375 | biostudies-literature
| S-EPMC8504628 | biostudies-literature
| S-EPMC6031024 | biostudies-literature
| S-EPMC6075087 | biostudies-other
| S-EPMC3929842 | biostudies-literature
| S-EPMC4387895 | biostudies-literature
| S-EPMC10053428 | biostudies-literature
| S-EPMC6021116 | biostudies-literature
| S-EPMC10436054 | biostudies-literature