Unknown

Dataset Information

0

In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data.


ABSTRACT: Four popular somatic single nucleotide variant (SNV) calling methods (Varscan, SomaticSniper, Strelka and MuTect2) were carefully evaluated on the real whole exome sequencing (WES, depth of ~50X) and ultra-deep targeted sequencing (UDT-Seq, depth of ~370X) data. The four tools returned poor consensus on candidates (only 20% of calls were with multiple hits by the callers). For both WES and UDT-Seq, MuTect2 and Strelka obtained the largest proportion of COSMIC entries as well as the lowest rate of dbSNP presence and high-alternative-alleles-in-control calls, demonstrating their superior sensitivity and accuracy. Combining different callers does increase reliability of candidates, but narrows the list down to very limited range of tumor read depth and variant allele frequency. Calling SNV on UDT-Seq data, which were of much higher read-depth, discovered additional true-positive variations, despite an even more tremendous growth in false positive predictions. Our findings not only provide valuable benchmark for state-of-the-art SNV calling methods, but also shed light on the access to more accurate SNV identification in the future.

SUBMITTER: Cai L 

PROVIDER: S-EPMC5118795 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data.

Cai Lei L   Yuan Wei W   Zhang Zhou Z   He Lin L   Chou Kuo-Chen KC  

Scientific reports 20161122


Four popular somatic single nucleotide variant (SNV) calling methods (Varscan, SomaticSniper, Strelka and MuTect2) were carefully evaluated on the real whole exome sequencing (WES, depth of ~50X) and ultra-deep targeted sequencing (UDT-Seq, depth of ~370X) data. The four tools returned poor consensus on candidates (only 20% of calls were with multiple hits by the callers). For both WES and UDT-Seq, MuTect2 and Strelka obtained the largest proportion of COSMIC entries as well as the lowest rate o  ...[more]

Similar Datasets

| S-EPMC3971343 | biostudies-literature
| S-EPMC3785481 | biostudies-literature
| S-EPMC7044309 | biostudies-literature
| S-EPMC7293574 | biostudies-literature
| 41336 | ecrin-mdr-crc
| S-EPMC4920415 | biostudies-literature
| S-EPMC6176345 | biostudies-literature
| S-EPMC3702398 | biostudies-literature
| S-EPMC4035752 | biostudies-literature
| S-EPMC3604800 | biostudies-literature