Unknown

Dataset Information

0

VariantBam: filtering and profiling of next-generational sequencing data using region-specific rules.


ABSTRACT: We developed VariantBam, a C?++?read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis.VariantBam and full documentation are available at github.com/jwalabroad/VariantBamrameen@broadinstitute.orgSupplementary data are available at Bioinformatics online.

SUBMITTER: Wala J 

PROVIDER: S-EPMC4920121 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

VariantBam: filtering and profiling of next-generational sequencing data using region-specific rules.

Wala Jeremiah J   Zhang Cheng-Zhong CZ   Meyerson Matthew M   Beroukhim Rameen R  

Bioinformatics (Oxford, England) 20160226 13


<h4>Unlabelled</h4>We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when  ...[more]

Similar Datasets

| S-EPMC6485405 | biostudies-literature
| S-EPMC3371040 | biostudies-literature
| S-EPMC4476701 | biostudies-literature
| S-EPMC5427585 | biostudies-literature
| S-EPMC9792947 | biostudies-literature
| S-EPMC5583360 | biostudies-literature
| S-EPMC7282053 | biostudies-literature
| S-EPMC4344483 | biostudies-literature
| S-EPMC5037392 | biostudies-literature
| S-EPMC5374681 | biostudies-literature