Unknown

Dataset Information

0

Sparse Biclustering of Transposable Data.


ABSTRACT: We consider the task of simultaneously clustering the rows and columns of a large transposable data matrix. We assume that the matrix elements are normally distributed with a bicluster-specific mean term and a common variance, and perform biclustering by maximizing the corresponding log likelihood. We apply an ?1 penalty to the means of the biclusters in order to obtain sparse and interpretable biclusters. Our proposal amounts to a sparse, symmetrized version of k-means clustering. We show that k-means clustering of the rows and of the columns of a data matrix can be seen as special cases of our proposal, and that a relaxation of our proposal yields the singular value decomposition. In addition, we propose a framework for bi-clustering based on the matrix-variate normal distribution. The performances of our proposals are demonstrated in a simulation study and on a gene expression data set. This article has supplementary material online.

SUBMITTER: Tan KM 

PROVIDER: S-EPMC4212513 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sparse Biclustering of Transposable Data.

Tan Kean Ming KM   Witten Daniela M DM  

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America 20140101 4


We consider the task of simultaneously clustering the rows and columns of a large transposable data matrix. We assume that the matrix elements are normally distributed with a bicluster-specific mean term and a common variance, and perform biclustering by maximizing the corresponding log likelihood. We apply an ℓ<sub>1</sub> penalty to the means of the biclusters in order to obtain sparse and interpretable biclusters. Our proposal amounts to a sparse, symmetrized version of <i>k</i>-means cluster  ...[more]

Similar Datasets

| S-EPMC7028479 | biostudies-literature
| S-EPMC8574648 | biostudies-literature
| S-EPMC3411756 | biostudies-literature
| S-EPMC4978779 | biostudies-literature
| S-EPMC10153449 | biostudies-literature
| S-EPMC3375643 | biostudies-literature
| S-EPMC10701104 | biostudies-literature
| S-EPMC6598466 | biostudies-literature
| S-EPMC6931057 | biostudies-literature
| S-EPMC7272289 | biostudies-literature