Dataset Information

CoVaCS: a consensus variant calling system.

ABSTRACT:

Background

The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasing underlining the need for accurate and time-effective bioinformatics systems for genotyping - a crucial prerequisite for identification of candidate causal mutations in diagnostic screens.

Results

Here we present CoVaCS, a fully automated, highly accurate system with a web based graphical interface for genotyping and variant annotation. Extensive tests on a gold standard benchmark data-set -the NA12878 Illumina platinum genome- confirm that call-sets based on our consensus strategy are completely in line with those attained by similar command line based approaches, and far more accurate than call-sets from any individual tool. Importantly our system exhibits better sensitivity and higher specificity than equivalent commercial software.

Conclusions

CoVaCS offers optimized pipelines integrating state of the art tools for variant calling and annotation for whole genome sequencing (WGS), whole-exome sequencing (WES) and target-gene sequencing (TGS) data. The system is currently hosted at Cineca, and offers the speed of a HPC computing facility, a crucial consideration when large numbers of samples must be analysed. Importantly, all the analyses are performed automatically allowing high reproducibility of the results. As such, we believe that CoVaCS can be a valuable tool for the analysis of human genome resequencing studies. CoVaCS is available at: https://bioinformatics.cineca.it/covacs .

SUBMITTER: Chiara M

PROVIDER: S-EPMC5800023 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

CoVaCS: a consensus variant calling system.

Chiara Matteo M Gioiosa Silvia S Chillemi Giovanni G D'Antonio Mattia M Flati Tiziano T Picardi Ernesto E Zambelli Federico F Horner David Stephen DS Pesole Graziano G Castrignanò Tiziana T

BMC genomics 20180205 1

<h4>Background</h4>The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasing underlining the need for accurate and time-effective bioinformatics systems for genotyping - a crucial prerequisite for identification of candidate causal mutations in diagnostic screens.<h4> ...[more]

PMID: 29402227

Dataset Information

CoVaCS: a consensus variant calling system.

Background

Results

Conclusions

Publications

CoVaCS: a consensus variant calling system.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

SomaticCombiner: improving the performance of somatic variant calling based on evaluation tests and a consensus approach.
| S-EPMC7393490 | biostudies-literature

Multithreaded variant calling in elPrep 5.
| S-EPMC7861424 | biostudies-literature

Parliament2: Accurate structural variant calling at scale.
| S-EPMC7751401 | biostudies-literature

ToTem: a tool for variant calling pipeline optimization.
| S-EPMC6020218 | biostudies-literature

HaploTypo: a variant-calling pipeline for phased genomes.
| S-EPMC7178392 | biostudies-literature

Best practices for variant calling in clinical sequencing.
| S-EPMC7586657 | biostudies-literature

Reliable variant calling during runtime of Illumina sequencing.
| S-EPMC6848508 | biostudies-literature