Ontology highlight
ABSTRACT: Motivation
Recent advances in high-throughput omics technologies have enabled biomedical researchers to collect large-scale genomic data. As a consequence, there has been growing interest in developing methods to integrate such data to obtain deeper insights regarding the underlying biological system. A key challenge for integrative studies is the heterogeneity present in the different omics data sources, which makes it difficult to discern the coordinated signal of interest from source-specific noise or extraneous effects.Results
We introduce a novel method of multi-modal data analysis that is designed for heterogeneous data based on non-negative matrix factorization. We provide an algorithm for jointly decomposing the data matrices involved that also includes a sparsity option for high-dimensional settings. The performance of the proposed method is evaluated on synthetic data and on real DNA methylation, gene expression and miRNA expression data from ovarian cancer samples obtained from The Cancer Genome Atlas. The results show the presence of common modules across patient samples linked to cancer-related pathways, as well as previously established ovarian cancer subtypes.Availability and implementation
The source code repository is publicly available at https://github.com/yangzi4/iNMF.Contact
gmichail@umich.eduSupplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Yang Z
PROVIDER: S-EPMC5006236 | biostudies-literature | 2016 Jan
REPOSITORIES: biostudies-literature
Yang Zi Z Michailidis George G
Bioinformatics (Oxford, England) 20150915 1
<h4>Motivation</h4>Recent advances in high-throughput omics technologies have enabled biomedical researchers to collect large-scale genomic data. As a consequence, there has been growing interest in developing methods to integrate such data to obtain deeper insights regarding the underlying biological system. A key challenge for integrative studies is the heterogeneity present in the different omics data sources, which makes it difficult to discern the coordinated signal of interest from source- ...[more]