Unknown

Dataset Information

0

Combined burden and functional impact tests for cancer driver discovery using DriverPower.


ABSTRACT: The discovery of driver mutations is one of the key motivations for cancer genome sequencing. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumour types, we describe DriverPower, a software package that uses mutational burden and functional impact evidence to identify driver mutations in coding and non-coding sites within cancer whole genomes. Using a total of 1373 genomic features derived from public sources, DriverPower's background mutation model explains up to 93% of the regional variance in the mutation rate across multiple tumour types. By incorporating functional impact scores, we are able to further increase the accuracy of driver discovery. Testing across a collection of 2583 cancer genomes from the PCAWG project, DriverPower identifies 217 coding and 95 non-coding driver candidates. Comparing to six published methods used by the PCAWG Drivers and Functional Interpretation Working Group, DriverPower has the highest F1 score for both coding and non-coding driver discovery. This demonstrates that DriverPower is an effective framework for computational driver discovery.

SUBMITTER: Shuai S 

PROVIDER: S-EPMC7002750 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combined burden and functional impact tests for cancer driver discovery using DriverPower.

Shuai Shimin S   Gallinger Steven S   Stein Lincoln D LD  

Nature communications 20200205 1


The discovery of driver mutations is one of the key motivations for cancer genome sequencing. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumour types, we describe DriverPower, a software package that uses mutational burden and functional impact evidence to identify driver mutations in coding and non-coding sites within cancer whole genomes. Using a total of 1373 genomic features  ...[more]

Similar Datasets

| S-EPMC7683798 | biostudies-literature
| PRJNA326795 | ENA
| S-EPMC3483541 | biostudies-literature
| S-EPMC4133618 | biostudies-literature
| S-EPMC8039954 | biostudies-literature
| S-EPMC6936061 | biostudies-literature
| S-EPMC5449640 | biostudies-literature
| S-EPMC4247168 | biostudies-literature
| S-EPMC7642382 | biostudies-literature
| S-EPMC7807764 | biostudies-literature