Unknown

Dataset Information

0

Detection of Somatic Mutations in Exome Sequencing of Tumor-only Samples.


ABSTRACT: Due to lack of normal samples in clinical diagnosis and to reduce costs, detection of small-scale mutations from tumor-only samples is required but remains relatively unexplored. We developed an algorithm (GATKcan) augmenting GATK with two statistics and machine learning to detect mutations in cancer. The averaged performance of GATKcan in ten experiments outperformed GATK in detecting mutations of randomly sampled 231 from 241 TCGA endometrial tumors (EC). In external validations, GATKcan outperformed GATK in TCGA breast cancer (BC), ovarian cancer (OC) and melanoma tumors, in terms of Matthews correlation coefficient (MCC) and precision, where MCC takes both sensitivity and specificity into account. Further, GATKcan reduced high fractions of false positives detected by GATK. In mutation detection of somatic variants, classified commonly by VarScan 2 and MuTect from the called variants in BC, OC and melanoma, ranked by adjusted MCC (adjusted precision) GATKcan was the top 1, followed by MuTect, VarScan 2 and GATK. Importantly, GATKcan enables detection of mutations when alternate alleles exist in normal samples. These results suggest that GATKcan trained by a cancer is able to detect mutations in future patients with the same type of cancer and is likely applicable to other cancers with similar mutations.

SUBMITTER: Hsu YC 

PROVIDER: S-EPMC5698426 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection of Somatic Mutations in Exome Sequencing of Tumor-only Samples.

Hsu Yu-Chin YC   Hsiao Yu-Ting YT   Kao Tzu-Yuan TY   Chang Jan-Gowth JG   Shieh Grace S GS  

Scientific reports 20171121 1


Due to lack of normal samples in clinical diagnosis and to reduce costs, detection of small-scale mutations from tumor-only samples is required but remains relatively unexplored. We developed an algorithm (GATKcan) augmenting GATK with two statistics and machine learning to detect mutations in cancer. The averaged performance of GATKcan in ten experiments outperformed GATK in detecting mutations of randomly sampled 231 from 241 TCGA endometrial tumors (EC). In external validations, GATKcan outpe  ...[more]

Similar Datasets

| S-EPMC3413789 | biostudies-literature
| S-EPMC4803342 | biostudies-literature
| S-EPMC4990858 | biostudies-literature
| S-EPMC10350007 | biostudies-literature
| S-EPMC3432702 | biostudies-literature
| S-EPMC6565602 | biostudies-literature
| S-EPMC4302757 | biostudies-literature
| S-EPMC5627484 | biostudies-literature
| S-EPMC5584341 | biostudies-literature
| S-EPMC4073211 | biostudies-literature