Unknown

Dataset Information

0

ARNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments.


ABSTRACT:

Summary

The wide range of RNA-seq applications and their high-computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high-performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high-performance computing (HPC) environment. The current applications included in aRNApipe combine the essential RNA-seq primary analyses, including quality control metrics, transcript alignment, count generation, transcript fusion identification, alternative splicing and sequence variant calling. aRNApipe is project-oriented and dynamic so users can easily update analyses to include or exclude samples or enable additional processing modules. Workflow parameters are easily set using a single configuration file that provides centralized tracking of all analytical processes. Finally, aRNApipe incorporates interactive web reports for sample tracking and a tool for managing the genome assemblies available to perform an analysis.

Availability and documentation

https://github.com/HudsonAlpha/aRNAPipe ; DOI: 10.5281/zenodo.202950.

Contact

rmyers@hudsonalpha.org.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Alonso A 

PROVIDER: S-EPMC5447234 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

aRNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments.

Alonso Arnald A   Lasseigne Brittany N BN   Williams Kelly K   Nielsen Josh J   Ramaker Ryne C RC   Hardigan Andrew A AA   Johnston Bobbi B   Roberts Brian S BS   Cooper Sara J SJ   Marsal Sara S   Myers Richard M RM  

Bioinformatics (Oxford, England) 20170601 11


<h4>Summary</h4>The wide range of RNA-seq applications and their high-computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high-performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high-performance computing (HPC) environment. The current applications included in aRNApipe combi  ...[more]

Similar Datasets

| S-EPMC8044432 | biostudies-literature
| S-EPMC3051320 | biostudies-literature
| S-EPMC5897949 | biostudies-literature
| S-EPMC7579964 | biostudies-literature
| S-EPMC4907397 | biostudies-literature
| S-EPMC5155159 | biostudies-other
| S-EPMC4918025 | biostudies-other
| S-EPMC7525341 | biostudies-literature
| S-EPMC4179863 | biostudies-other
| S-EPMC3467745 | biostudies-literature