Unknown

Dataset Information

0

System for Quality-Assured Data Analysis: Flexible, reproducible scientific workflows.


ABSTRACT: The reproducibility of scientific processes is one of the paramount problems of bioinformatics, an engineering problem that must be addressed to perform good research. The System for Quality-Assured Data Analysis (SyQADA), described here, seeks to address reproducibility by managing many of the details of procedural bookkeeping in bioinformatics in as simple and transparent a manner as possible. SyQADA has been used by persons with backgrounds ranging from expert programmer to Unix novice, to perform and repeat dozens of diverse bioinformatics workflows on tens of thousands of samples, consuming over 80 CPU-months of computing on over 300,000 individual tasks of scores of projects on laptops, computer servers, and computing clusters. SyQADA is especially well-suited for paired-sample analyses found in cancer tumor-normal studies. SyQADA executable source code, documentation, tutorial examples, and workflows used in our lab is available from http://scheet.org/software.html.

SUBMITTER: Fowler J 

PROVIDER: S-EPMC6571143 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8637247 | biostudies-literature
| S-EPMC10436054 | biostudies-literature
| S-EPMC5501156 | biostudies-literature
| S-EPMC10055538 | biostudies-literature
| S-EPMC7479590 | biostudies-literature
| S-EPMC7431652 | biostudies-literature
| S-EPMC8514239 | biostudies-literature
| S-EPMC6223375 | biostudies-literature
| S-EPMC7403855 | biostudies-literature
| S-EPMC3167055 | biostudies-literature