Unknown

Dataset Information

0

Genozip: a fast and efficient compression tool for VCF files.


ABSTRACT:

Motivation

genozip is a new lossless compression tool for Variant Call Format (VCF) files. By applying field-specific algorithms and fully utilizing the available computational hardware, genozip achieves the highest compression ratios amongst existing lossless compression tools known to the authors, at speeds comparable with the fastest multi-threaded compressors.

Availability and implementation

genozip is freely available to non-commercial users. It can be installed via conda-forge, Docker Hub, or downloaded from github.com/divonlan/genozip.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Lan D 

PROVIDER: S-EPMC7332572 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

genozip: a fast and efficient compression tool for VCF files.

Lan Divon D   Tobler Raymond R   Souilmi Yassine Y   Llamas Bastien B  

Bioinformatics (Oxford, England) 20200701 13


<h4>Motivation</h4>genozip is a new lossless compression tool for Variant Call Format (VCF) files. By applying field-specific algorithms and fully utilizing the available computational hardware, genozip achieves the highest compression ratios amongst existing lossless compression tools known to the authors, at speeds comparable with the fastest multi-threaded compressors.<h4>Availability and implementation</h4>genozip is freely available to non-commercial users. It can be installed via conda-for  ...[more]

Similar Datasets

| S-EPMC11258903 | biostudies-literature
| S-EPMC7116594 | biostudies-literature
| S-EPMC4376647 | biostudies-literature
| S-EPMC4793895 | biostudies-literature
| S-EPMC7487613 | biostudies-literature
| S-EPMC9825764 | biostudies-literature
| S-EPMC6007233 | biostudies-literature
| S-EPMC6078172 | biostudies-literature
| S-EPMC9218589 | biostudies-literature
| S-EPMC11226158 | biostudies-literature