Unknown

Dataset Information

0

Homogeneity Pursuit.


ABSTRACT: This paper explores the homogeneity of coefficients in high-dimensional regression, which extends the sparsity concept and is more general and suitable for many applications. Homogeneity arises when regression coefficients corresponding to neighboring geographical regions or a similar cluster of covariates are expected to be approximately the same. Sparsity corresponds to a special case of homogeneity with a large cluster of known atom zero. In this article, we propose a new method called clustering algorithm in regression via data-driven segmentation (CARDS) to explore homogeneity. New mathematics are provided on the gain that can be achieved by exploring homogeneity. Statistical properties of two versions of CARDS are analyzed. In particular, the asymptotic normality of our proposed CARDS estimator is established, which reveals better estimation accuracy for homogeneous parameters than that without homogeneity exploration. When our methods are combined with sparsity exploration, further efficiency can be achieved beyond the exploration of sparsity alone. This provides additional insights into the power of exploring low-dimensional structures in high-dimensional regression: homogeneity and sparsity. Our results also shed lights on the properties of the fussed Lasso. The newly developed method is further illustrated by simulation studies and applications to real data. Supplementary materials for this article are available online.

SUBMITTER: Ke T 

PROVIDER: S-EPMC4465377 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Homogeneity Pursuit.

Ke Tracy T   Fan Jianqing J   Wu Yichao Y  

Journal of the American Statistical Association 20150101 509


This paper explores the homogeneity of coefficients in high-dimensional regression, which extends the sparsity concept and is more general and suitable for many applications. Homogeneity arises when regression coefficients corresponding to neighboring geographical regions or a similar cluster of covariates are expected to be approximately the same. Sparsity corresponds to a special case of homogeneity with a large cluster of known atom zero. In this article, we propose a new method called cluste  ...[more]

Similar Datasets

| S-EPMC7182294 | biostudies-literature
| S-EPMC6140545 | biostudies-literature
| S-EPMC5366095 | biostudies-literature
| S-EPMC7007341 | biostudies-literature
| S-EPMC8266361 | biostudies-literature
| S-EPMC5444509 | biostudies-literature
| S-EPMC6095539 | biostudies-literature
| S-EPMC4669211 | biostudies-literature
| S-EPMC10088529 | biostudies-literature
| S-EPMC3223069 | biostudies-literature