Unknown

Dataset Information

0

A machine learning approach for somatic mutation discovery.


ABSTRACT: Variability in the accuracy of somatic mutation detection may affect the discovery of alterations and the therapeutic management of cancer patients. To address this issue, we developed a somatic mutation discovery approach based on machine learning that outperformed existing methods in identifying experimentally validated tumor alterations (sensitivity of 97% versus 90 to 99%; positive predictive value of 98% versus 34 to 92%). Analysis of paired tumor-normal exome data from 1368 TCGA (The Cancer Genome Atlas) samples using this method revealed concordance for 74% of mutation calls but also identified likely false-positive and false-negative changes in TCGA data, including in clinically actionable genes. Determination of high-quality somatic mutation calls improved tumor mutation load-based predictions of clinical outcome for melanoma and lung cancer patients previously treated with immune checkpoint inhibitors. Integration of high-quality machine learning mutation detection in clinical next-generation sequencing (NGS) analyses increased the accuracy of test results compared to other clinical sequencing analyses. These analyses provide an approach for improved identification of tumor-specific mutations and have important implications for research and clinical management of cancer patients.

SUBMITTER: Wood DE 

PROVIDER: S-EPMC6481619 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications


Variability in the accuracy of somatic mutation detection may affect the discovery of alterations and the therapeutic management of cancer patients. To address this issue, we developed a somatic mutation discovery approach based on machine learning that outperformed existing methods in identifying experimentally validated tumor alterations (sensitivity of 97% versus 90 to 99%; positive predictive value of 98% versus 34 to 92%). Analysis of paired tumor-normal exome data from 1368 TCGA (The Cance  ...[more]

Similar Datasets

2022-10-01 | GSE200096 | GEO
2020-09-01 | E-MTAB-9501 | biostudies-arrayexpress
| S-EPMC10227442 | biostudies-literature
| S-EPMC7372518 | biostudies-literature
| S-EPMC6241511 | biostudies-literature
| S-EPMC5905914 | biostudies-other
| S-EPMC6600179 | biostudies-literature
| S-EPMC10257182 | biostudies-literature
| S-EPMC6609671 | biostudies-other
| S-EPMC6137445 | biostudies-other