Unknown

Dataset Information

0

An empirical Bayesian framework for somatic mutation detection from cancer genome sequencing data.


ABSTRACT: Recent advances in high-throughput sequencing technologies have enabled a comprehensive dissection of the cancer genome clarifying a large number of somatic mutations in a wide variety of cancer types. A number of methods have been proposed for mutation calling based on a large amount of sequencing data, which is accomplished in most cases by statistically evaluating the difference in the observed allele frequencies of possible single nucleotide variants between tumours and paired normal samples. However, an accurate detection of mutations remains a challenge under low sequencing depths or tumour contents. To overcome this problem, we propose a novel method, Empirical Bayesian mutation Calling (https://github.com/friend1ws/EBCall), for detecting somatic mutations. Unlike previous methods, the proposed method discriminates somatic mutations from sequencing errors based on an empirical Bayesian framework, where the model parameters are estimated using sequencing data from multiple non-paired normal samples. Using 13 whole-exome sequencing data with 87.5-206.3 mean sequencing depths, we demonstrate that our method not only outperforms several existing methods in the calling of mutations with moderate allele frequencies but also enables accurate calling of mutations with low allele frequencies (? 10%) harboured within a minor tumour subpopulation, thus allowing for the deciphering of fine substructures within a tumour specimen.

SUBMITTER: Shiraishi Y 

PROVIDER: S-EPMC3627598 | biostudies-literature | 2013 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

An empirical Bayesian framework for somatic mutation detection from cancer genome sequencing data.

Shiraishi Yuichi Y   Sato Yusuke Y   Chiba Kenichi K   Okuno Yusuke Y   Nagata Yasunobu Y   Yoshida Kenichi K   Shiba Norio N   Hayashi Yasuhide Y   Kume Haruki H   Homma Yukio Y   Sanada Masashi M   Ogawa Seishi S   Miyano Satoru S  

Nucleic acids research 20130306 7


Recent advances in high-throughput sequencing technologies have enabled a comprehensive dissection of the cancer genome clarifying a large number of somatic mutations in a wide variety of cancer types. A number of methods have been proposed for mutation calling based on a large amount of sequencing data, which is accomplished in most cases by statistically evaluating the difference in the observed allele frequencies of possible single nucleotide variants between tumours and paired normal samples  ...[more]

Similar Datasets

| S-EPMC4682041 | biostudies-literature
| S-EPMC3971343 | biostudies-literature
| S-EPMC3259434 | biostudies-literature
| S-EPMC9022462 | biostudies-literature
| S-EPMC3219132 | biostudies-literature
| S-EPMC6853710 | biostudies-literature
| S-EPMC8532138 | biostudies-literature
| S-EPMC10350007 | biostudies-literature
| S-EPMC5988673 | biostudies-literature
| S-EPMC9302205 | biostudies-literature