Dataset Information

Multi-view clustering for multi-omics data using unified embedding.

ABSTRACT: In real world applications, data sets are often comprised of multiple views, which provide consensus and complementary information to each other. Embedding learning is an effective strategy for nearest neighbour search and dimensionality reduction in large data sets. This paper attempts to learn a unified probability distribution of the points across different views and generates a unified embedding in a low-dimensional space to optimally preserve neighbourhood identity. Probability distributions generated for each point for each view are combined by conflation method to create a single unified distribution. The goal is to approximate this unified distribution as much as possible when a similar operation is performed on the embedded space. As a cost function, the sum of Kullback-Leibler divergence over the samples is used, which leads to a simple gradient adjusting the position of the samples in the embedded space. The proposed methodology can generate embedding from both complete and incomplete multi-view data sets. Finally, a multi-objective clustering technique (AMOSA) is applied to group the samples in the embedded space. The proposed methodology, Multi-view Neighbourhood Embedding (MvNE), shows an improvement of approximately 2-3% over state-of-the-art models when evaluated on 10 omics data sets.

SUBMITTER: Mitra S

PROVIDER: S-EPMC7423957 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Multi-view clustering for multi-omics data using unified embedding.

Mitra Sayantan S Saha Sriparna S Hasanuzzaman Mohammed M

Scientific reports 20200812 1

In real world applications, data sets are often comprised of multiple views, which provide consensus and complementary information to each other. Embedding learning is an effective strategy for nearest neighbour search and dimensionality reduction in large data sets. This paper attempts to learn a unified probability distribution of the points across different views and generates a unified embedding in a low-dimensional space to optimally preserve neighbourhood identity. Probability distribution ...[more]

PMID: 32788601

Dataset Information

Multi-view clustering for multi-omics data using unified embedding.

Publications

Multi-view clustering for multi-omics data using unified embedding.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Multi-view Subspace Clustering Analysis for Aggregating Multiple Heterogeneous Omics Data.
| S-EPMC6712585 | biostudies-literature

Clustering single-cell multi-omics data via graph regularized multi-view ensemble learning.
| S-EPMC11015955 | biostudies-literature

A Unified Bayesian Framework for Bi-overlapping-Clustering Multi-omics Data via Sparse Matrix Factorization.
| S-EPMC10766378 | biostudies-literature

MOVIS: A multi-omics software solution for multi-modal time-series clustering, embedding, and visualizing tasks.
| S-EPMC8886009 | biostudies-literature

OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data.
| S-EPMC8235477 | biostudies-literature

Clustering single-cell multi-omics data with MoClust.
| S-EPMC9805570 | biostudies-literature

Omics community detection using multi-resolution clustering.
| S-EPMC8545346 | biostudies-literature

Clustering multilayer omics data using MuNCut.
| S-EPMC5991460 | biostudies-literature

A unified model for interpretable latent embedding of multi-sample, multi-condition single-cell data.
| S-EPMC11298001 | biostudies-literature

Deep clustering representation of spatially resolved transcriptomics data using multi-view variational graph auto-encoders with consensus clustering.
| S-EPMC11664090 | biostudies-literature