Unknown

Dataset Information

0

Intersect-then-combine approach: improving the performance of somatic variant calling in whole exome sequencing data using multiple aligners and callers.


ABSTRACT: Bioinformatic analysis of genomic sequencing data to identify somatic mutations in cancer samples is far from achieving the required robustness and standardisation. In this study we generated a whole exome sequencing benchmark dataset using the platinum genome sample NA12878 and developed an intersect-then-combine (ITC) approach to increase the accuracy in calling single nucleotide variants (SNVs) and indels in tumour-normal pairs. We evaluated the effect of alignment, base quality recalibration, mutation caller and filtering on sensitivity and false positive rate. The ITC approach increased the sensitivity up to 17.1%, without increasing the false positive rate per megabase (FPR/Mb) and its validity was confirmed in a set of clinical samples.

SUBMITTER: Callari M 

PROVIDER: S-EPMC5394620 | biostudies-literature | 2017 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Intersect-then-combine approach: improving the performance of somatic variant calling in whole exome sequencing data using multiple aligners and callers.

Callari Maurizio M   Sammut Stephen-John SJ   De Mattos-Arruda Leticia L   Bruna Alejandra A   Rueda Oscar M OM   Chin Suet-Feung SF   Caldas Carlos C  

Genome medicine 20170418 1


Bioinformatic analysis of genomic sequencing data to identify somatic mutations in cancer samples is far from achieving the required robustness and standardisation. In this study we generated a whole exome sequencing benchmark dataset using the platinum genome sample NA12878 and developed an intersect-then-combine (ITC) approach to increase the accuracy in calling single nucleotide variants (SNVs) and indels in tumour-normal pairs. We evaluated the effect of alignment, base quality recalibration  ...[more]

Similar Datasets

| S-EPMC8509018 | biostudies-literature
| S-EPMC4803342 | biostudies-literature
| S-EPMC3986649 | biostudies-literature
| S-EPMC10350007 | biostudies-literature
| S-EPMC4137624 | biostudies-literature
| S-EPMC7604644 | biostudies-literature
| S-EPMC5657037 | biostudies-literature
| S-EPMC8756192 | biostudies-literature
| S-EPMC6580603 | biostudies-literature
| S-EPMC7393490 | biostudies-literature