Unknown

Dataset Information

0

Jenkins-CI, an Open-Source Continuous Integration System, as a Scientific Data and Image-Processing Platform.


ABSTRACT: High-throughput screening generates large volumes of heterogeneous data that require a diverse set of computational tools for management, processing, and analysis. Building integrated, scalable, and robust computational workflows for such applications is challenging but highly valuable. Scientific data integration and pipelining facilitate standardized data processing, collaboration, and reuse of best practices. We describe how Jenkins-CI, an "off-the-shelf," open-source, continuous integration system, is used to build pipelines for processing images and associated data from high-content screening (HCS). Jenkins-CI provides numerous plugins for standard compute tasks, and its design allows the quick integration of external scientific applications. Using Jenkins-CI, we integrated CellProfiler, an open-source image-processing platform, with various HCS utilities and a high-performance Linux cluster. The platform is web-accessible, facilitates access and sharing of high-performance compute resources, and automates previously cumbersome data and image-processing tasks. Imaging pipelines developed using the desktop CellProfiler client can be managed and shared through a centralized Jenkins-CI repository. Pipelines and managed data are annotated to facilitate collaboration and reuse. Limitations with Jenkins-CI (primarily around the user interface) were addressed through the selection of helper plugins from the Jenkins-CI community.

SUBMITTER: Moutsatsos IK 

PROVIDER: S-EPMC5322829 | biostudies-other | 2017 Mar

REPOSITORIES: biostudies-other

altmetric image

Publications

Jenkins-CI, an Open-Source Continuous Integration System, as a Scientific Data and Image-Processing Platform.

Moutsatsos Ioannis K IK   Hossain Imtiaz I   Agarinis Claudia C   Harbinski Fred F   Abraham Yann Y   Dobler Luc L   Zhang Xian X   Wilson Christopher J CJ   Jenkins Jeremy L JL   Holway Nicholas N   Tallarico John J   Parker Christian N CN  

SLAS discovery : advancing life sciences R & D 20161213 3


High-throughput screening generates large volumes of heterogeneous data that require a diverse set of computational tools for management, processing, and analysis. Building integrated, scalable, and robust computational workflows for such applications is challenging but highly valuable. Scientific data integration and pipelining facilitate standardized data processing, collaboration, and reuse of best practices. We describe how Jenkins-CI, an "off-the-shelf," open-source, continuous integration  ...[more]

Similar Datasets

| S-EPMC4892737 | biostudies-literature
| S-EPMC3855844 | biostudies-literature
| S-EPMC6087497 | biostudies-literature
| S-EPMC7642381 | biostudies-literature
| S-EPMC7469624 | biostudies-literature
| S-EPMC8978263 | biostudies-literature
| S-EPMC6031065 | biostudies-literature
| S-EPMC4891379 | biostudies-other
| S-EPMC7815964 | biostudies-literature
| S-EPMC7490629 | biostudies-literature