Dataset Information

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.

ABSTRACT: Ridge regression is a regularization technique that penalizes the L2-norm of the coefficients in linear regression. One of the challenges of using ridge regression is the need to set a hyperparameter (α) that controls the amount of regularization. Cross-validation is typically used to select the best α from a set of candidates. However, efficient and appropriate selection of α can be challenging. This becomes prohibitive when large amounts of data are analyzed. Because the selected α depends on the scale of the data and correlations across predictors, it is also not straightforwardly interpretable. The present work addresses these challenges through a novel approach to ridge regression. We propose to reparameterize ridge regression in terms of the ratio γ between the L2-norms of the regularized and unregularized coefficients. We provide an algorithm that efficiently implements this approach, called fractional ridge regression, as well as open-source software implementations in Python and matlab (https://github.com/nrdg/fracridge). We show that the proposed method is fast and scalable for large-scale data problems. In brain imaging data, we demonstrate that this approach delivers results that are straightforward to interpret and compare across models and datasets. Fractional ridge regression has several benefits: the solutions obtained for different γ are guaranteed to vary, guarding against wasted calculations; and automatically span the relevant range of regularization, avoiding the need for arduous manual exploration. These properties make fractional ridge regression particularly suitable for analysis of large complex datasets.

SUBMITTER: Rokem A

PROVIDER: S-EPMC7702219 | biostudies-literature | 2020 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.

Rokem Ariel A Kay Kendrick K

GigaScience 20201101 12

<h4>Background</h4>Ridge regression is a regularization technique that penalizes the L2-norm of the coefficients in linear regression. One of the challenges of using ridge regression is the need to set a hyperparameter (α) that controls the amount of regularization. Cross-validation is typically used to select the best α from a set of candidates. However, efficient and appropriate selection of α can be challenging. This becomes prohibitive when large amounts of data are analyzed. Because the sel ...[more]

PMID: 33252656

Dataset Information

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.

Publications

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Fast Multi-Locus Ridge Regression Algorithm for High-Dimensional Genome-Wide Association Studies.
| S-EPMC8041068 | biostudies-literature

Ridge regression in prediction problems: automatic choice of the ridge parameter.
| S-EPMC4377081 | biostudies-other

OKRidge: Scalable Optimal k-Sparse Ridge Regression.
| S-EPMC10950455 | biostudies-literature

Feature-space selection with banded ridge regression.
| S-EPMC9807218 | biostudies-literature

Adaptive ridge regression for rare variant detection.
| S-EPMC3429469 | biostudies-literature

Multilocus association mapping using generalized ridge logistic regression.
| S-EPMC3224109 | biostudies-literature

Ridge regression and its applications in genetic studies.
| S-EPMC8031387 | biostudies-literature

Significance testing in ridge regression for genetic data.
| S-EPMC3228544 | biostudies-literature

Broken adaptive ridge regression and its asymptotic properties.
| S-EPMC6430210 | biostudies-literature

Fast construction of interpretable whole-brain decoders.
| S-EPMC9243546 | biostudies-literature