Unknown

Dataset Information

0

Kernel Entropy Component Analysis with Nongreedy L1-Norm Maximization.


ABSTRACT: Kernel entropy component analysis (KECA) is a newly proposed dimensionality reduction (DR) method, which has showed superiority in many pattern analysis issues previously solved by principal component analysis (PCA). The optimized KECA (OKECA) is a state-of-the-art variant of KECA and can return projections retaining more expressive power than KECA. However, OKECA is sensitive to outliers and accused of its high computational complexities due to its inherent properties of L2-norm. To handle these two problems, we develop a new extension to KECA, namely, KECA-L1, for DR or feature extraction. KECA-L1 aims to find a more robust kernel decomposition matrix such that the extracted features retain information potential as much as possible, which is measured by L1-norm. Accordingly, we design a nongreedy iterative algorithm which has much faster convergence than OKECA's. Moreover, a general semisupervised classifier is developed for KECA-based methods and employed into the data classification. Extensive experiments on data classification and software defect prediction demonstrate that our new method is superior to most existing KECA- and PCA-based approaches. Code has been also made publicly available.

SUBMITTER: Ji H 

PROVIDER: S-EPMC6204191 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Kernel Entropy Component Analysis with Nongreedy L1-Norm Maximization.

Ji Haijin H   Huang Song S  

Computational intelligence and neuroscience 20181014


Kernel entropy component analysis (KECA) is a newly proposed dimensionality reduction (DR) method, which has showed superiority in many pattern analysis issues previously solved by principal component analysis (PCA). The optimized KECA (OKECA) is a state-of-the-art variant of KECA and can return projections retaining more expressive power than KECA. However, OKECA is sensitive to outliers and accused of its high computational complexities due to its inherent properties of L2-norm. To handle thes  ...[more]

Similar Datasets

| S-EPMC3746759 | biostudies-literature
| S-EPMC3601508 | biostudies-literature
| S-EPMC4559466 | biostudies-literature
| S-EPMC7498035 | biostudies-literature
| S-EPMC2906488 | biostudies-literature
| S-EPMC8154256 | biostudies-literature
| S-EPMC3176196 | biostudies-literature
| S-EPMC1994960 | biostudies-literature