Unknown

Dataset Information

0

FIREVAT: finding reliable variants without artifacts in human cancer samples using etiologically relevant mutational signatures.


ABSTRACT: BACKGROUND:Accurate identification of real somatic variants is a primary part of cancer genome studies and precision oncology. However, artifacts introduced in various steps of sequencing obfuscate confidence in variant calling. Current computational approaches to variant filtering involve intensive interrogation of Binary Alignment Map (BAM) files and require massive computing power, data storage, and manual labor. Recently, mutational signatures associated with sequencing artifacts have been extracted by the Pan-cancer Analysis of Whole Genomes (PCAWG) study. These spectrums can be used to evaluate refinement quality of a given set of somatic mutations. RESULTS:Here we introduce a novel variant refinement software, FIREVAT (FInding REliable Variants without ArTifacts), which uses known spectrums of sequencing artifacts extracted from one of the largest publicly available catalogs of human tumor samples. FIREVAT performs a quick and efficient variant refinement that accurately removes artifacts and greatly improves the precision and specificity of somatic calls. We validated FIREVAT refinement performance using orthogonal sequencing datasets totaling 384 tumor samples with respect to ground truth. Our novel method achieved the highest level of performance compared to existing filtering approaches. Application of FIREVAT on additional 308 The Cancer Genome Atlas (TCGA) samples demonstrated that FIREVAT refinement leads to identification of more biologically and clinically relevant mutational signatures as well as enrichment of sequence contexts associated with experimental errors. FIREVAT only requires a Variant Call Format file (VCF) and generates a comprehensive report of the variant refinement processes and outcomes for the user. CONCLUSIONS:In summary, FIREVAT facilitates a novel refinement strategy using mutational signatures to distinguish artifactual point mutations called in human cancer samples. We anticipate that FIREVAT results will further contribute to precision oncology efforts that rely on accurate identification of variants, especially in the context of analyzing mutational signatures that bear prognostic and therapeutic significance. FIREVAT is freely available at https://github.com/cgab-ncc/FIREVAT.

SUBMITTER: Kim H 

PROVIDER: S-EPMC6916105 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

FIREVAT: finding reliable variants without artifacts in human cancer samples using etiologically relevant mutational signatures.

Kim Hyunbin H   Lee Andy Jinseok AJ   Lee Jongkeun J   Chun Hyonho H   Ju Young Seok YS   Hong Dongwan D  

Genome medicine 20191217 1


<h4>Background</h4>Accurate identification of real somatic variants is a primary part of cancer genome studies and precision oncology. However, artifacts introduced in various steps of sequencing obfuscate confidence in variant calling. Current computational approaches to variant filtering involve intensive interrogation of Binary Alignment Map (BAM) files and require massive computing power, data storage, and manual labor. Recently, mutational signatures associated with sequencing artifacts hav  ...[more]

Similar Datasets

| S-EPMC5957269 | biostudies-literature
| S-EPMC6001047 | biostudies-literature
| S-EPMC4817139 | biostudies-literature
| S-EPMC8597642 | biostudies-literature
| PRJEB60585 | ENA
| S-EPMC9864147 | biostudies-literature
| S-EPMC3541986 | biostudies-literature
| S-EPMC9170691 | biostudies-literature
| S-EPMC3461587 | biostudies-literature
| S-EPMC10622322 | biostudies-literature