Unknown

Dataset Information

0

An architecture for genomics analysis in a clinical setting using Galaxy and Docker.


ABSTRACT: Next-generation sequencing is used on a daily basis to perform molecular analysis to determine subtypes of disease (e.g., in cancer) and to assist in the selection of the optimal treatment. Clinical bioinformatics handles the manipulation of the data generated by the sequencer, from the generation to the analysis and interpretation. Reproducibility and traceability are crucial issues in a clinical setting. We have designed an approach based on Docker container technology and Galaxy, the popular bioinformatics analysis support open-source software. Our solution simplifies the deployment of a small-size analytical platform and simplifies the process for the clinician. From the technical point of view, the tools embedded in the platform are isolated and versioned through Docker images. Along the Galaxy platform, we also introduce the AnalysisManager, a solution that allows single-click analysis for biologists and leverages standardized bioinformatics application programming interfaces. We added a Shiny/R interactive environment to ease the visualization of the outputs. The platform relies on containers and ensures the data traceability by recording analytical actions and by associating inputs and outputs of the tools to EDAM ontology through ReGaTe. The source code is freely available on Github at https://github.com/CARPEM/GalaxyDocker.

SUBMITTER: Digan W 

PROVIDER: S-EPMC5691353 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

An architecture for genomics analysis in a clinical setting using Galaxy and Docker.

Digan W W   Countouris H H   Barritault M M   Baudoin D D   Laurent-Puig P P   Blons H H   Burgun A A   Rance B B  

GigaScience 20171101 11


Next-generation sequencing is used on a daily basis to perform molecular analysis to determine subtypes of disease (e.g., in cancer) and to assist in the selection of the optimal treatment. Clinical bioinformatics handles the manipulation of the data generated by the sequencer, from the generation to the analysis and interpretation. Reproducibility and traceability are crucial issues in a clinical setting. We have designed an approach based on Docker container technology and Galaxy, the popular  ...[more]

Similar Datasets

| S-EPMC10132306 | biostudies-literature
| S-EPMC4669641 | biostudies-literature
| S-EPMC4203657 | biostudies-other
| PRJEB80875 | ENA
| S-EPMC5437943 | biostudies-literature
2015-07-02 | PXD001655 | Pride
| S-EPMC5333608 | biostudies-literature
| S-EPMC4261978 | biostudies-literature
| PRJNA158499 | ENA
| S-EPMC4160307 | biostudies-literature