Unknown

Dataset Information

0

DriveWays: a method for identifying possibly overlapping driver pathways in cancer.


ABSTRACT: The majority of the previous methods for identifying cancer driver modules output nonoverlapping modules. This assumption is biologically inaccurate as genes can participate in multiple molecular pathways. This is particularly true for cancer-associated genes as many of them are network hubs connecting functionally distinct set of genes. It is important to provide combinatorial optimization problem definitions modeling this biological phenomenon and to suggest efficient algorithms for its solution. We provide a formal definition of the Overlapping Driver Module Identification in Cancer (ODMIC) problem. We show that the problem is NP-hard. We propose a seed-and-extend based heuristic named DriveWays that identifies overlapping cancer driver modules from the graph built from the IntAct PPI network. DriveWays incorporates mutual exclusivity, coverage, and the network connectivity information of the genes. We show that DriveWays outperforms the state-of-the-art methods in recovering well-known cancer driver genes performed on TCGA pan-cancer data. Additionally, DriveWay's output modules show a stronger enrichment for the reference pathways in almost all cases. Overall, we show that enabling modules to overlap improves the recovery of functional pathways filtered with known cancer drivers, which essentially constitute the reference set of cancer-related pathways.

SUBMITTER: Baali I 

PROVIDER: S-EPMC7738685 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

DriveWays: a method for identifying possibly overlapping driver pathways in cancer.

Baali Ilyes I   Erten Cesim C   Kazan Hilal H  

Scientific reports 20201215 1


The majority of the previous methods for identifying cancer driver modules output nonoverlapping modules. This assumption is biologically inaccurate as genes can participate in multiple molecular pathways. This is particularly true for cancer-associated genes as many of them are network hubs connecting functionally distinct set of genes. It is important to provide combinatorial optimization problem definitions modeling this biological phenomenon and to suggest efficient algorithms for its soluti  ...[more]

Similar Datasets

| S-EPMC6100049 | biostudies-literature
| S-EPMC6145398 | biostudies-literature
| S-EPMC3769934 | biostudies-literature
| S-EPMC6099653 | biostudies-literature
| S-EPMC4914110 | biostudies-literature
| S-EPMC3018819 | biostudies-other
| S-EPMC5308635 | biostudies-literature
| S-EPMC4111852 | biostudies-other
| S-EPMC4426832 | biostudies-other
| S-EPMC3866686 | biostudies-literature