Unknown

Dataset Information

0

Penalized classification using Fisher's linear discriminant.


ABSTRACT: We consider the supervised classification setting, in which the data consist of p features measured on n observations, each of which belongs to one of K classes. Linear discriminant analysis (LDA) is a classical method for this problem. However, in the high-dimensional setting where p ? n, LDA is not appropriate for two reasons. First, the standard estimate for the within-class covariance matrix is singular, and so the usual discriminant rule cannot be applied. Second, when p is large, it is difficult to interpret the classification rule obtained from LDA, since it involves all p features. We propose penalized LDA, a general approach for penalizing the discriminant vectors in Fisher's discriminant problem in a way that leads to greater interpretability. The discriminant problem is not convex, so we use a minorization-maximization approach in order to efficiently optimize it when convex penalties are applied to the discriminant vectors. In particular, we consider the use of L(1) and fused lasso penalties. Our proposal is equivalent to recasting Fisher's discriminant problem as a biconvex problem. We evaluate the performances of the resulting methods on a simulation study, and on three gene expression data sets. We also survey past methods for extending LDA to the high-dimensional setting, and explore their relationships with our proposal.

SUBMITTER: Witten DM 

PROVIDER: S-EPMC3272679 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Penalized classification using Fisher's linear discriminant.

Witten Daniela M DM   Tibshirani Robert R  

Journal of the Royal Statistical Society. Series B, Statistical methodology 20111101 5


We consider the supervised classification setting, in which the data consist of p features measured on n observations, each of which belongs to one of K classes. Linear discriminant analysis (LDA) is a classical method for this problem. However, in the high-dimensional setting where p ≫ n, LDA is not appropriate for two reasons. First, the standard estimate for the within-class covariance matrix is singular, and so the usual discriminant rule cannot be applied. Second, when p is large, it is dif  ...[more]

Similar Datasets

| S-EPMC5655752 | biostudies-literature
| S-EPMC4143686 | biostudies-literature
| S-EPMC7505231 | biostudies-literature
| S-EPMC6810714 | biostudies-literature
| S-EPMC3770000 | biostudies-literature