Unknown

Dataset Information

0

Multi-view Subspace Clustering Analysis for Aggregating Multiple Heterogeneous Omics Data.


ABSTRACT: Integration of distinct biological data types could provide a comprehensive view of biological processes or complex diseases. The combinations of molecules responsible for different phenotypes form multiple embedded (expression) subspaces, thus identifying the intrinsic data structure is challenging by regular integration methods. In this paper, we propose a novel framework of "Multi-view Subspace Clustering Analysis (MSCA)," which could measure the local similarities of samples in the same subspace and obtain the global consensus sample patterns (structures) for multiple data types, thereby comprehensively capturing the underlying heterogeneity of samples. Applied to various synthetic datasets, MSCA performs effectively to recognize the predefined sample patterns, and is robust to data noises. Given a real biological dataset, i.e., Cancer Cell Line Encyclopedia (CCLE) data, MSCA successfully identifies cell clusters of common aberrations across cancer types. A remarkable superiority over the state-of-the-art methods, such as iClusterPlus, SNF, and ANF, has also been demonstrated in our simulation and case studies.

SUBMITTER: Shi Q 

PROVIDER: S-EPMC6712585 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Multi-view Subspace Clustering Analysis for Aggregating Multiple Heterogeneous Omics Data.

Shi Qianqian Q   Hu Bing B   Zeng Tao T   Zhang Chuanchao C  

Frontiers in genetics 20190820


Integration of distinct biological data types could provide a comprehensive view of biological processes or complex diseases. The combinations of molecules responsible for different phenotypes form multiple embedded (expression) subspaces, thus identifying the intrinsic data structure is challenging by regular integration methods. In this paper, we propose a novel framework of "Multi-view Subspace Clustering Analysis (MSCA)," which could measure the local similarities of samples in the same subs  ...[more]

Similar Datasets

| S-EPMC7423957 | biostudies-literature
| S-EPMC5441581 | biostudies-literature
| S-EPMC6102576 | biostudies-literature
| S-EPMC6455926 | biostudies-literature
| S-EPMC7157996 | biostudies-literature
| S-EPMC7986585 | biostudies-literature
| S-EPMC8696097 | biostudies-literature
| S-EPMC7161108 | biostudies-literature
| S-EPMC5773919 | biostudies-literature
| S-EPMC6010767 | biostudies-literature