Unknown

Dataset Information

0

Bazam: a rapid method for read extraction and realignment of high-throughput sequencing data.


ABSTRACT: The vast quantities of short-read sequencing data being generated are often exchanged and stored as aligned reads. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Here we describe Bazam, a tool that efficiently extracts the original paired FASTQ from alignment files (BAM or CRAM format) in a format that directly allows efficient realignment. Bazam facilitates up to a 90% reduction in the time for realignment compared to standard methods. Bazam can support selective extraction of read pairs from focused genomic regions for applications such as targeted region analyses, quality control, structural variant calling, and alignment comparisons.

SUBMITTER: Sadedin SP 

PROVIDER: S-EPMC6472072 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bazam: a rapid method for read extraction and realignment of high-throughput sequencing data.

Sadedin Simon P SP   Oshlack Alicia A  

Genome biology 20190418 1


The vast quantities of short-read sequencing data being generated are often exchanged and stored as aligned reads. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Here we describe Bazam, a tool that efficiently extracts the original paired FASTQ from alignment files (BAM or CRAM format) in a format that directly allows efficient realignment. Bazam facilitates up to a 90% reduction in the time for realignment compared to standard methods. Ba  ...[more]

Similar Datasets

| S-EPMC6735924 | biostudies-literature
| S-EPMC2532726 | biostudies-literature
| S-EPMC4429651 | biostudies-literature
| S-EPMC4301848 | biostudies-literature
| S-EPMC8360517 | biostudies-literature
| S-EPMC3416827 | biostudies-literature
| S-EPMC5547002 | biostudies-literature
| S-EPMC6114050 | biostudies-literature
| S-EPMC5622927 | biostudies-literature
| S-EPMC3805581 | biostudies-literature