Unknown

Dataset Information

0

SURPRISES IN HIGH-DIMENSIONAL RIDGELESS LEAST SQUARES INTERPOLATION.


ABSTRACT: Interpolators-estimators that achieve zero training error-have attracted growing attention in machine learning, mainly because state-of-the art neural networks appear to be models of this type. In this paper, we study minimum 2 norm ("ridgeless") interpolation least squares regression, focusing on the high-dimensional regime in which the number of unknown parameters p is of the same order as the number of samples n. We consider two different models for the feature distribution: a linear model, where the feature vectors xi∈ℝp are obtained by applying a linear transform to a vector of i.i.d. entries, x i = Σ1/2 z i (with zi∈ℝp ); and a nonlinear model, where the feature vectors are obtained by passing the input through a random one-layer neural network, xi = φ(Wz i ) (with zi∈ℝd , W∈ℝp×d a matrix of i.i.d. entries, and φ an activation function acting componentwise on Wz i ). We recover-in a precise quantitative way-several phenomena that have been observed in large-scale neural networks and kernel machines, including the "double descent" behavior of the prediction risk, and the potential benefits of overparametrization.

SUBMITTER: Hastie T 

PROVIDER: S-EPMC9481183 | biostudies-literature | 2022 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

SURPRISES IN HIGH-DIMENSIONAL RIDGELESS LEAST SQUARES INTERPOLATION.

Hastie Trevor T   Montanari Andrea A   Rosset Saharon S   Tibshirani Ryan J RJ  

Annals of statistics 20220407 2


Interpolators-estimators that achieve zero training error-have attracted growing attention in machine learning, mainly because state-of-the art neural networks appear to be models of this type. In this paper, we study minimum <i>ℓ</i> <sub>2</sub> norm ("ridgeless") interpolation least squares regression, focusing on the high-dimensional regime in which the number of unknown parameters <i>p</i> is of the same order as the number of samples <i>n</i>. We consider two different models for the featu  ...[more]

Similar Datasets

| S-EPMC2861314 | biostudies-literature
| S-EPMC6953292 | biostudies-literature
| S-EPMC5338488 | biostudies-literature
| S-EPMC33532 | biostudies-literature
| S-EPMC6563228 | biostudies-literature
| S-EPMC6402661 | biostudies-literature
| S-EPMC8110750 | biostudies-literature
| S-EPMC4378190 | biostudies-literature
| S-EPMC3491686 | biostudies-literature
| S-EPMC4762065 | biostudies-literature