Unknown

Dataset Information

0

Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data.


ABSTRACT: BACKGROUND:Whole exome sequencing (WES) is a cost-effective method that identifies clinical variants but it demands accurate variant caller tools. Currently available tools have variable accuracy in predicting specific clinical variants. But it may be possible to find the best combination of aligner-variant caller tools for detecting accurate single nucleotide variants (SNVs) and small insertion and deletion (InDels) separately. Moreover, many important aspects of InDel detection are overlooked while comparing the performance of tools, particularly its base pair length. RESULTS:We assessed the performance of variant calling pipelines using the combinations of four variant callers and five aligners on human NA12878 and simulated exome data. We used high confidence variant calls from Genome in a Bottle (GiaB) consortium for validation, and GRCh37 and GRCh38 as the human reference genome. Based on the performance metrics, both BWA and Novoalign aligners performed better with DeepVariant and SAMtools callers for detecting SNVs, and with DeepVariant and GATK for InDels. Furthermore, we obtained similar results on human NA24385 and NA24631 exome data from GiaB. CONCLUSION:In this study, DeepVariant with BWA and Novoalign performed best for detecting accurate SNVs and InDels. The accuracy of variant calling was improved by merging the top performing pipelines. The results of our study provide useful recommendations for analysis of WES data in clinical genomics.

SUBMITTER: Kumaran M 

PROVIDER: S-EPMC6580603 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data.

Kumaran Manojkumar M   Subramanian Umadevi U   Devarajan Bharanidharan B  

BMC bioinformatics 20190617 1


<h4>Background</h4>Whole exome sequencing (WES) is a cost-effective method that identifies clinical variants but it demands accurate variant caller tools. Currently available tools have variable accuracy in predicting specific clinical variants. But it may be possible to find the best combination of aligner-variant caller tools for detecting accurate single nucleotide variants (SNVs) and small insertion and deletion (InDels) separately. Moreover, many important aspects of InDel detection are ove  ...[more]

Similar Datasets

| S-EPMC8509018 | biostudies-literature
| S-EPMC7604644 | biostudies-literature
| S-EPMC4129436 | biostudies-literature
| S-EPMC3706896 | biostudies-literature
| S-EPMC5394620 | biostudies-literature
| S-EPMC4137624 | biostudies-literature
| S-EPMC4240813 | biostudies-literature
| S-EPMC4671096 | biostudies-literature
| S-EPMC6370902 | biostudies-other
| S-EPMC8141913 | biostudies-literature