Unknown

Dataset Information

0

Bayesian structural equation modeling in multiple omics data with application to circadian genes.


ABSTRACT:

Motivation

It is well known that the integration among different data-sources is reliable because of its potential of unveiling new functionalities of the genomic expressions, which might be dormant in a single-source analysis. Moreover, different studies have justified the more powerful analyses of multi-platform data. Toward this, in this study, we consider the circadian genes' omics profile, such as copy number changes and RNA-sequence data along with their survival response. We develop a Bayesian structural equation modeling coupled with linear regressions and log normal accelerated failure-time regression to integrate the information between these two platforms to predict the survival of the subjects. We place conjugate priors on the regression parameters and derive the Gibbs sampler using the conditional distributions of them.

Results

Our extensive simulation study shows that the integrative model provides a better fit to the data than its closest competitor. The analyses of glioblastoma cancer data and the breast cancer data from TCGA, the largest genomics and transcriptomics database, support our findings.

Availability and implementation

The developed method is wrapped in R package available at https://github.com/MAITYA02/semmcmc.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Maity AK 

PROVIDER: S-EPMC7332567 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC9075702 | biostudies-literature
| S-EPMC5590745 | biostudies-literature
| S-EPMC7565120 | biostudies-literature
| S-EPMC7187322 | biostudies-literature
| S-EPMC7187790 | biostudies-literature
| S-EPMC9838875 | biostudies-literature
| S-EPMC8098022 | biostudies-literature
| S-EPMC8162623 | biostudies-literature
| S-EPMC3371320 | biostudies-literature
| S-EPMC4689874 | biostudies-literature