Ontology highlight
ABSTRACT:
SUBMITTER: Faust GG
PROVIDER: S-EPMC4147885 | biostudies-literature | 2014 Sep
REPOSITORIES: biostudies-literature
Faust Gregory G GG Hall Ira M IM
Bioinformatics (Oxford, England) 20140507 17
<h4>Motivation</h4>Illumina DNA sequencing is now the predominant source of raw genomic data, and data volumes are growing rapidly. Bioinformatic analysis pipelines are having trouble keeping pace. A common bottleneck in such pipelines is the requirement to read, write, sort and compress large BAM files multiple times.<h4>Results</h4>We present SAMBLASTER, a tool that reduces the number of times such costly operations are performed. SAMBLASTER is designed to mark duplicates in read-sorted SAM fi ...[more]