Unknown

Dataset Information

0

Identifying stage-specific protein subnetworks for colorectal cancer.


ABSTRACT:

Background

In recent years, many algorithms have been developed for network-based analysis of differential gene expression in complex diseases. These algorithms use protein-protein interaction (PPI) networks as an integrative framework and identify subnetworks that are coordinately dysregulated in the phenotype of interest.

Motivation

While such dysregulated subnetworks have demonstrated significant improvement over individual gene markers for classifying phenotype, the current state-of-the-art in dysregulated subnetwork discovery is almost exclusively limited to binary phenotype classes. However, many clinical applications require identification of molecular markers for multiple classes.

Approach

We consider the problem of discovering groups of genes whose expression signatures can discriminate multiple phenotype classes. We consider two alternate formulations of this problem (i) an all-vs-all approach that aims to discover subnetworks distinguishing all classes, (ii) a one-vs-all approach that aims to discover subnetworks distinguishing each class from the rest of the classes. For the one-vs-all formulation, we develop a set-cover based algorithm, which aims to identify groups of genes such that at least one gene in the group exhibits differential expression in the target class.

Results

We test the proposed algorithms in the context of predicting stages of colorectal cancer. Our results show that the set-cover based algorithm identifying "stage-specific" subnetworks outperforms the all-vs-all approaches in classification. We also investigate the merits of utilizing PPI networks in the search for multiple markers, and show that, with correct parameter settings, network-guided search improves performance. Furthermore, we show that assessing statistical significance when selecting features greatly improves classification performance.

SUBMITTER: Erten S 

PROVIDER: S-EPMC3504924 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2667362 | biostudies-literature
| S-EPMC6458499 | biostudies-literature
| S-EPMC7566085 | biostudies-literature
| S-EPMC3553975 | biostudies-literature
| S-EPMC6129270 | biostudies-literature
| S-EPMC2914729 | biostudies-literature
2021-11-04 | GSE106584 | GEO
| S-EPMC4887059 | biostudies-literature
| S-EPMC7345993 | biostudies-literature
| S-EPMC8022234 | biostudies-literature