Unknown

Dataset Information

0

An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction.


ABSTRACT: Prediction models that use gene expression levels are now being proposed for personalized treatment of cancer, but building accurate models that are easy to interpret remains a challenge. In this paper, we describe an integrative clinical-genomic approach that combines both genomic pathway and clinical information. First, we summarize information from genes in each pathway using Supervised Principal Components (SPCA) to obtain pathway-based genomic predictors. Next, we build a prediction model based on clinical variables and pathway-based genomic predictors using Random Survival Forests (RSF). Our rationale for this two-stage procedure is that the underlying disease process may be influenced by environmental exposure (measured by clinical variables) and perturbations in different pathways (measured by pathway-based genomic variables), as well as their interactions. Using two cancer microarray datasets, we show that the pathway-based clinical-genomic model outperforms gene-based clinical-genomic models, with improved prediction accuracy and interpretability.

SUBMITTER: Chen X 

PROVIDER: S-EPMC3124349 | biostudies-literature | 2010 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction.

Chen Xi X   Wang Lily L   Ishwaran Hemant H  

Statistics & probability letters 20100901 17-18


Prediction models that use gene expression levels are now being proposed for personalized treatment of cancer, but building accurate models that are easy to interpret remains a challenge. In this paper, we describe an integrative clinical-genomic approach that combines both genomic pathway and clinical information. First, we summarize information from genes in each pathway using Supervised Principal Components (SPCA) to obtain pathway-based genomic predictors. Next, we build a prediction model b  ...[more]

Similar Datasets

2011-10-07 | E-GEOD-32688 | biostudies-arrayexpress
| S-EPMC4167688 | biostudies-literature
| S-ECPF-GEOD-32676 | biostudies-other
| S-ECPF-GEOD-32678 | biostudies-other
| S-EPMC4168973 | biostudies-literature
| S-EPMC4861129 | biostudies-literature
| S-EPMC8672105 | biostudies-literature
2011-10-08 | GSE32688 | GEO
2011-10-07 | E-GEOD-32676 | biostudies-arrayexpress
2011-10-07 | E-GEOD-32682 | biostudies-arrayexpress