Unknown

Dataset Information

0

The cluster graphical lasso for improved estimation of Gaussian graphical models.


ABSTRACT: The task of estimating a Gaussian graphical model in the high-dimensional setting is considered. The graphical lasso, which involves maximizing the Gaussian log likelihood subject to a lasso penalty, is a well-studied approach for this task. A surprising connection between the graphical lasso and hierarchical clustering is introduced: the graphical lasso in effect performs a two-step procedure, in which (1) single linkage hierarchical clustering is performed on the variables in order to identify connected components, and then (2) a penalized log likelihood is maximized on the subset of variables within each connected component. Thus, the graphical lasso determines the connected components of the estimated network via single linkage clustering. The single linkage clustering is known to perform poorly in certain finite-sample settings. Therefore, the cluster graphical lasso, which involves clustering the features using an alternative to single linkage clustering, and then performing the graphical lasso on the subset of variables within each cluster, is proposed. Model selection consistency for this technique is established, and its improved performance relative to the graphical lasso is demonstrated in a simulation study, as well as in applications to a university webpage and a gene expression data sets.

SUBMITTER: Tan KM 

PROVIDER: S-EPMC4307846 | biostudies-literature | 2015 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

The cluster graphical lasso for improved estimation of Gaussian graphical models.

Tan Kean Ming KM   Witten Daniela D   Shojaie Ali A  

Computational statistics & data analysis 20150501


The task of estimating a Gaussian graphical model in the high-dimensional setting is considered. The graphical lasso, which involves maximizing the Gaussian log likelihood subject to a <i>lasso</i> penalty, is a well-studied approach for this task. A surprising connection between the graphical lasso and hierarchical clustering is introduced: the graphical lasso in effect performs a two-step procedure, in which (1) single linkage hierarchical clustering is performed on the variables in order to i  ...[more]

Similar Datasets

| S-EPMC5515703 | biostudies-literature
| S-EPMC2808166 | biostudies-literature
| S-EPMC5640885 | biostudies-literature
| S-EPMC6916355 | biostudies-literature
| S-EPMC6456846 | biostudies-literature
| S-EPMC4012833 | biostudies-literature
| S-EPMC6901079 | biostudies-literature
| S-EPMC7540244 | biostudies-literature
| S-EPMC8424921 | biostudies-literature
| S-EPMC4974017 | biostudies-literature