Dataset Information

Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.

ABSTRACT: Accurate prognostic prediction using molecular information is a challenging area of research, which is essential to develop precision medicine. In this paper, we develop translational models to identify major actionable proteins that are associated with clinical outcomes, like the survival time of patients. There are considerable statistical and computational challenges due to the large dimension of the problems. Furthermore, data are available for different tumor types; hence data integration for various tumors is desirable. Having censored survival outcomes escalates one more level of complexity in the inferential procedure. We develop Bayesian hierarchical survival models, which accommodate all the challenges mentioned here. We use the hierarchical Bayesian accelerated failure time model for survival regression. Furthermore, we assume sparse horseshoe prior distribution for the regression coefficients to identify the major proteomic drivers. We borrow strength across tumor groups by introducing a correlation structure among the prior distributions. The proposed methods have been used to analyze data from the recently curated "The Cancer Proteome Atlas" (TCPA), which contains reverse-phase protein arrays-based high-quality protein expression data as well as detailed clinical annotation, including survival times. Our simulation and the TCPA data analysis illustrate the efficacy of the proposed integrative model, which links different tumors with the correlated prior structures.

SUBMITTER: Maity AK

PROVIDER: S-EPMC7007312 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.

Maity Arnab Kumar AK Bhattacharya Anirban A Mallick Bani K BK Baladandayuthapani Veerabhadran V

Biometrics 20191003 1

Accurate prognostic prediction using molecular information is a challenging area of research, which is essential to develop precision medicine. In this paper, we develop translational models to identify major actionable proteins that are associated with clinical outcomes, like the survival time of patients. There are considerable statistical and computational challenges due to the large dimension of the problems. Furthermore, data are available for different tumor types; hence data integration f ...[more]

PMID: 31393003

Dataset Information

Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.

Publications

Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Integration of Survival and Binary Data for Variable Selection and Prediction: A Bayesian Approach.
| S-EPMC7729996 | biostudies-literature

Integration of Multiple Genomic Data Sources in a Bayesian Cox Model for Variable Selection and Prediction.
| S-EPMC5554576 | biostudies-other

Bayesian variable selection for parametric survival model with applications to cancer omics data.
| S-EPMC6218990 | biostudies-literature

Bayesian ensemble methods for survival prediction in gene expression data.
| S-EPMC3031034 | biostudies-literature

Bayesian bi-level variable selection for genome-wide survival study.
| S-EPMC10584651 | biostudies-literature

Scalable Bayesian variable selection for structured high-dimensional data.
| S-EPMC6222001 | biostudies-literature

Bayesian subset selection and variable importance for interpretable prediction and classification.
| S-EPMC10723825 | biostudies-literature

Bayesian Multiresolution Variable Selection for Ultra-High Dimensional Neuroimaging Data.
| S-EPMC5885321 | biostudies-literature

Fine mapping and accurate prediction of complex traits using Bayesian Variable Selection models applied to biobank-size data.
| S-EPMC9995454 | biostudies-literature

Tractable Bayesian variable selection: beyond normality.
| S-EPMC6426142 | biostudies-literature