Unknown

Dataset Information

0

Pathway aggregation for survival prediction via multiple kernel learning.


ABSTRACT: Attempts to predict prognosis in cancer patients using high-dimensional genomic data such as gene expression in tumor tissue can be made difficult by the large number of features and the potential complexity of the relationship between features and the outcome. Integrating prior biological knowledge into risk prediction with such data by grouping genomic features into pathways and networks reduces the dimensionality of the problem and could improve prediction accuracy. Additionally, such knowledge-based models may be more biologically grounded and interpretable. Prediction could potentially be further improved by allowing for complex nonlinear pathway effects. The kernel machine framework has been proposed as an effective approach for modeling the nonlinear and interactive effects of genes in pathways for both censored and noncensored outcomes. When multiple pathways are under consideration, one may efficiently select informative pathways and aggregate their signals via multiple kernel learning (MKL), which has been proposed for prediction of noncensored outcomes. In this paper, we propose MKL methods for censored survival outcomes. We derive our approach for a general survival modeling framework with a convex objective function and illustrate its application under the Cox proportional hazards and semiparametric accelerated failure time models. Numerical studies demonstrate that the proposed MKL-based prediction methods work well in finite sample and can potentially outperform models constructed assuming linear effects or ignoring the group knowledge. The methods are illustrated with an application to 2 cancer data sets.

SUBMITTER: Sinnott JA 

PROVIDER: S-EPMC5994931 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pathway aggregation for survival prediction via multiple kernel learning.

Sinnott Jennifer A JA   Cai Tianxi T  

Statistics in medicine 20180417 16


Attempts to predict prognosis in cancer patients using high-dimensional genomic data such as gene expression in tumor tissue can be made difficult by the large number of features and the potential complexity of the relationship between features and the outcome. Integrating prior biological knowledge into risk prediction with such data by grouping genomic features into pathways and networks reduces the dimensionality of the problem and could improve prediction accuracy. Additionally, such knowled  ...[more]

Similar Datasets

| S-EPMC8561914 | biostudies-literature
| S-EPMC6401099 | biostudies-literature
| S-EPMC6694479 | biostudies-literature
| S-EPMC5727873 | biostudies-literature
| S-EPMC9689861 | biostudies-literature
| S-EPMC3427351 | biostudies-literature
| S-EPMC7181520 | biostudies-literature
| S-EPMC7299324 | biostudies-literature
| S-EPMC5942586 | biostudies-literature
| S-EPMC6410562 | biostudies-literature