Unknown

Dataset Information

0

Meta-analytic principal component analysis in integrative omics application.


ABSTRACT: Motivation:With the prevalent usage of microarray and massively parallel sequencing, numerous high-throughput omics datasets have become available in the public domain. Integrating abundant information among omics datasets is critical to elucidate biological mechanisms. Due to the high-dimensional nature of the data, methods such as principal component analysis (PCA) have been widely applied, aiming at effective dimension reduction and exploratory visualization. Results:In this article, we combine multiple omics datasets of identical or similar biological hypothesis and introduce two variations of meta-analytic framework of PCA, namely MetaPCA. Regularization is further incorporated to facilitate sparse feature selection in MetaPCA. We apply MetaPCA and sparse MetaPCA to simulations, three transcriptomic meta-analysis studies in yeast cell cycle, prostate cancer, mouse metabolism and a TCGA pan-cancer methylation study. The result shows improved accuracy, robustness and exploratory visualization of the proposed framework. Availability and implementation:An R package MetaPCA is available online. (http://tsenglab.biostat.pitt.edu/software.htm). Contact:ctseng@pitt.edu. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Kim S 

PROVIDER: S-EPMC5905607 | biostudies-literature | 2018 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Meta-analytic principal component analysis in integrative omics application.

Kim SungHwan S   Kang Dongwan D   Huo Zhiguang Z   Park Yongseok Y   Tseng George C GC  

Bioinformatics (Oxford, England) 20180401 8


<h4>Motivation</h4>With the prevalent usage of microarray and massively parallel sequencing, numerous high-throughput omics datasets have become available in the public domain. Integrating abundant information among omics datasets is critical to elucidate biological mechanisms. Due to the high-dimensional nature of the data, methods such as principal component analysis (PCA) have been widely applied, aiming at effective dimension reduction and exploratory visualization.<h4>Results</h4>In this ar  ...[more]

Similar Datasets

| S-EPMC5912177 | biostudies-literature
| S-EPMC7663540 | biostudies-literature
| S-EPMC8638895 | biostudies-literature
2011-08-15 | GSE31375 | GEO
| S-EPMC7868054 | biostudies-literature
| S-EPMC6191041 | biostudies-literature
| S-EPMC4383722 | biostudies-literature
| S-EPMC4721272 | biostudies-literature
| S-EPMC3131008 | biostudies-literature
| S-EPMC2835171 | biostudies-literature