Unknown

Dataset Information

0

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.


ABSTRACT: Background: Ridge regression is a regularization technique that penalizes the L2-norm of the coefficients in linear regression. One of the challenges of using ridge regression is the need to set a hyperparameter (?) that controls the amount of regularization. Cross-validation is typically used to select the best ? from a set of candidates. However, efficient and appropriate selection of ? can be challenging. This becomes prohibitive when large amounts of data are analyzed. Because the selected ? depends on the scale of the data and correlations across predictors, it is also not straightforwardly interpretable.

Results: The present work addresses these challenges through a novel approach to ridge regression. We propose to reparameterize ridge regression in terms of the ratio ? between the L2-norms of the regularized and unregularized coefficients. We provide an algorithm that efficiently implements this approach, called fractional ridge regression, as well as open-source software implementations in Python and matlab (https://github.com/nrdg/fracridge). We show that the proposed method is fast and scalable for large-scale data problems. In brain imaging data, we demonstrate that this approach delivers results that are straightforward to interpret and compare across models and datasets.

Conclusion: Fractional ridge regression has several benefits: the solutions obtained for different ? are guaranteed to vary, guarding against wasted calculations; and automatically span the relevant range of regularization, avoiding the need for arduous manual exploration. These properties make fractional ridge regression particularly suitable for analysis of large complex datasets.

SUBMITTER: Rokem A 

PROVIDER: S-EPMC7702219 | biostudies-literature | 2020 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.

Rokem Ariel A   Kay Kendrick K  

GigaScience 20201101 12


<h4>Background</h4>Ridge regression is a regularization technique that penalizes the L2-norm of the coefficients in linear regression. One of the challenges of using ridge regression is the need to set a hyperparameter (α) that controls the amount of regularization. Cross-validation is typically used to select the best α from a set of candidates. However, efficient and appropriate selection of α can be challenging. This becomes prohibitive when large amounts of data are analyzed. Because the sel  ...[more]

Similar Datasets

| S-EPMC8041068 | biostudies-literature
| S-EPMC4377081 | biostudies-other
| S-EPMC10950455 | biostudies-literature
| S-EPMC9807218 | biostudies-literature
| S-EPMC3429469 | biostudies-literature
| S-EPMC3224109 | biostudies-literature
| S-EPMC8031387 | biostudies-literature
| S-EPMC3228544 | biostudies-literature
| S-EPMC6430210 | biostudies-literature
| S-EPMC9243546 | biostudies-literature