Unknown

Dataset Information

0

Alfred: interactive multi-sample BAM alignment statistics, feature counting and feature annotation for long- and short-read sequencing.


ABSTRACT: SUMMARY:Harmonizing quality control (QC) of large-scale second and third-generation sequencing datasets is key for enabling downstream computational and biological analyses. We present Alfred, an efficient and versatile command-line application that computes multi-sample QC metrics in a read-group aware manner, across a wide variety of sequencing assays and technologies. In addition to standard QC metrics such as GC bias, base composition, insert size and sequencing coverage distributions it supports haplotype-aware and allele-specific feature counting and feature annotation. The versatility of Alfred allows for easy pipeline integration in high-throughput settings, including DNA sequencing facilities and large-scale research initiatives, enabling continuous monitoring of sequence data quality and characteristics across samples. Alfred supports haplo-tagging of BAM/CRAM files to conduct haplotype-resolved analyses in conjunction with a variety of next-generation sequencing based assays. Alfred's companion web application enables interactive exploration of results and comparison to public datasets. AVAILABILITY AND IMPLEMENTATION:Alfred is open-source and freely available at https://tobiasrausch.com/alfred/. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Rausch T 

PROVIDER: S-EPMC6612896 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Alfred: interactive multi-sample BAM alignment statistics, feature counting and feature annotation for long- and short-read sequencing.

Rausch Tobias T   Hsi-Yang Fritz Markus M   Korbel Jan O JO   Benes Vladimir V  

Bioinformatics (Oxford, England) 20190701 14


<h4>Summary</h4>Harmonizing quality control (QC) of large-scale second and third-generation sequencing datasets is key for enabling downstream computational and biological analyses. We present Alfred, an efficient and versatile command-line application that computes multi-sample QC metrics in a read-group aware manner, across a wide variety of sequencing assays and technologies. In addition to standard QC metrics such as GC bias, base composition, insert size and sequencing coverage distribution  ...[more]

Similar Datasets

| S-EPMC5324271 | biostudies-literature
2013-07-15 | E-MTAB-1728 | biostudies-arrayexpress
| S-EPMC2700917 | biostudies-literature
| S-EPMC4075596 | biostudies-literature
| S-EPMC3464235 | biostudies-literature
| S-EPMC5799205 | biostudies-literature
| S-EPMC4015027 | biostudies-other
| S-EPMC4253828 | biostudies-literature
| S-EPMC3027120 | biostudies-literature
| PRJEB4265 | ENA