Dataset Information

Regularity Properties for Sparse Regression.

ABSTRACT: Statistical and machine learning theory has developed several conditions ensuring that popular estimators such as the Lasso or the Dantzig selector perform well in high-dimensional sparse regression, including the restricted eigenvalue, compatibility, and [Formula: see text] sensitivity properties. However, some of the central aspects of these conditions are not well understood. For instance, it is unknown if these conditions can be checked efficiently on any given data set. This is problematic, because they are at the core of the theory of sparse regression. Here we provide a rigorous proof that these conditions are NP-hard to check. This shows that the conditions are computationally infeasible to verify, and raises some questions about their practical applications. However, by taking an average-case perspective instead of the worst-case view of NP-hardness, we show that a particular condition, [Formula: see text] sensitivity, has certain desirable properties. This condition is weaker and more general than the others. We show that it holds with high probability in models where the parent population is well behaved, and that it is robust to certain data processing steps. These results are desirable, as they provide guidance about when the condition, and more generally the theory of sparse regression, may be relevant in the analysis of high-dimensional correlated observational data.

SUBMITTER: Dobriban E

PROVIDER: S-EPMC4909155 | biostudies-literature | 2016 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Regularity Properties for Sparse Regression.

Dobriban Edgar E Fan Jianqing J

Communications in mathematics and statistics 20160314 1

Statistical and machine learning theory has developed several conditions ensuring that popular estimators such as the Lasso or the Dantzig selector perform well in high-dimensional sparse regression, including the restricted eigenvalue, compatibility, and [Formula: see text] sensitivity properties. However, some of the central aspects of these conditions are not well understood. For instance, it is unknown if these conditions can be checked efficiently on any given data set. This is problematic, ...[more]

PMID: 27330929

Dataset Information

Regularity Properties for Sparse Regression.

Publications

Regularity Properties for Sparse Regression.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Sparse Regression by Projection and Sparse Discriminant Analysis.
| S-EPMC4560121 | biostudies-literature

Sequential Co-Sparse Factor Regression.
| S-EPMC6190918 | biostudies-literature

Sparse relative risk regression models.
| S-EPMC7868056 | biostudies-literature

Bayesian sparse reduced rank multivariate regression.
| S-EPMC5628626 | biostudies-literature

Sparse Sliced Inverse Regression Via Lasso.
| S-EPMC7500493 | biostudies-literature

SPReM: Sparse Projection Regression Model For High-dimensional Linear Regression.
| S-EPMC4627720 | biostudies-literature

OKRidge: Scalable Optimal k-Sparse Ridge Regression.
| S-EPMC10950455 | biostudies-literature

Sparse kernel machine regression for ordinal outcomes.
| S-EPMC4609171 | biostudies-literature

Sparse Regression Incorporating Graphical Structure among Predictors.
| S-EPMC5830184 | biostudies-literature

Sparse regression and marginal testing using cluster prototypes.
| S-EPMC5006118 | biostudies-literature