Unknown

Dataset Information

0

DigestiFlow: from BCL to FASTQ with ease


ABSTRACT: Abstract

Summary

Management of raw-sequencing data and its pre-processing (conversion into sequences and demultiplexing) remains a challenging topic for groups running sequencing devices. They face many challenges in such efforts and solutions ranging from manual management of spreadsheets to very complex and customized laboratory information management systems handling much more than just sequencing raw data. In this article, we describe the software package DigestiFlow that focuses on the management of Illumina flow cell sample sheets and raw data. It allows for automated extraction of information from flow cell data and management of sample sheets. Furthermore, it allows for the automated and reproducible conversion of Illumina base calls to sequences and the demultiplexing thereof using bcl2fastq and Picard Tools, followed by quality control report generation.

Availability and implementation

The software is available under the MIT license at https://github.com/bihealth/digestiflow-server. The client software components are available via Bioconda.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Holtgrewe M 

PROVIDER: S-EPMC7703778 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2847217 | biostudies-literature
| S-EPMC6547476 | biostudies-literature
| S-EPMC3606433 | biostudies-literature
| S-EPMC4459677 | biostudies-literature
2024-05-02 | E-MTAB-13687 | biostudies-arrayexpress
| S-EPMC8287537 | biostudies-literature
| PRJNA928832 | ENA
| PRJNA928833 | ENA
| PRJEB66260 | ENA
| S-EPMC4908325 | biostudies-literature