Unknown

Dataset Information

0

Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data.


ABSTRACT: BACKGROUND:Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, which presents challenges for identifying important features and interpreting analysis results. We propose two new mCIA methods: 1) a sparse mCIA method that produces sparse loading estimates and 2) a structured sparse mCIA method that further enables incorporation of structural information among variables such as those from functional genomics. RESULTS:Our extensive simulation studies demonstrate the superior performance of the sparse mCIA and structured sparse mCIA methods compared to the existing mCIA in terms of feature selection and estimation accuracy. Application to the integrative analysis of transcriptomics data and proteomics data from a cancer study identified biomarkers that are suggested in the literature related with cancer disease. CONCLUSION:Proposed sparse mCIA achieves simultaneous model estimation and feature selection and yields analysis results that are more interpretable than the existing mCIA. Furthermore, proposed structured sparse mCIA can effectively incorporate prior network information among genes, resulting in improved feature selection and enhanced interpretability.

SUBMITTER: Min EJ 

PROVIDER: S-EPMC7157996 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data.

Min Eun Jeong EJ   Long Qi Q  

BMC bioinformatics 20200415 1


<h4>Background</h4>Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, which presents challenges for identifying important features and interpreting analysis results. We propose two new mCIA methods: 1) a sparse mCIA method that produces sparse loading estimates and  ...[more]

Similar Datasets

| S-EPMC7663540 | biostudies-literature
| S-EPMC6419918 | biostudies-literature
| S-EPMC4209817 | biostudies-literature
| S-EPMC7979890 | biostudies-literature
| S-EPMC3901289 | biostudies-other
| S-EPMC4133046 | biostudies-literature
| S-EPMC8341864 | biostudies-literature
| S-EPMC4945831 | biostudies-literature
| S-EPMC6692785 | biostudies-literature
| S-EPMC6223369 | biostudies-literature