Unknown

Dataset Information

0

BCFtools/csq: haplotype-aware variant consequences.


ABSTRACT:

Motivation

Prediction of functional variant consequences is an important part of sequencing pipelines, allowing the categorization and prioritization of genetic variants for follow up analysis. However, current predictors analyze variants as isolated events, which can lead to incorrect predictions when adjacent variants alter the same codon, or when a frame-shifting indel is followed by a frame-restoring indel. Exploiting known haplotype information when making consequence predictions can resolve these issues.

Results

BCFtools/csq is a fast program for haplotype-aware consequence calling which can take into account known phase. Consequence predictions are changed for 501 of 5019 compound variants found in the 81.7M variants in the 1000 Genomes Project data, with an average of 139 compound variants per haplotype. Predictions match existing tools when run in localized mode, but the program is an order of magnitude faster and requires an order of magnitude less memory.

Availability and implementation

The program is freely available for commercial and non-commercial use in the BCFtools package which is available for download from http://samtools.github.io/bcftools .

Contact

pd3@sanger.ac.uk.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Danecek P 

PROVIDER: S-EPMC5870570 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

BCFtools/csq: haplotype-aware variant consequences.

Danecek Petr P   McCarthy Shane A SA  

Bioinformatics (Oxford, England) 20170701 13


<h4>Motivation</h4>Prediction of functional variant consequences is an important part of sequencing pipelines, allowing the categorization and prioritization of genetic variants for follow up analysis. However, current predictors analyze variants as isolated events, which can lead to incorrect predictions when adjacent variants alter the same codon, or when a frame-shifting indel is followed by a frame-restoring indel. Exploiting known haplotype information when making consequence predictions ca  ...[more]

Similar Datasets

| S-EPMC8519448 | biostudies-literature
| S-EPMC7223266 | biostudies-literature
| S-EPMC8571015 | biostudies-literature
| S-EPMC6547545 | biostudies-literature
| S-EPMC7066762 | biostudies-literature
| S-EPMC10612404 | biostudies-literature
| S-EPMC10274712 | biostudies-literature
| S-EPMC8092372 | biostudies-literature
| S-EPMC9022890 | biostudies-literature
| S-EPMC8549298 | biostudies-literature