Unknown

Dataset Information

0

A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data.


ABSTRACT: Feature screening plays an important role in dimension reduction for ultrahigh-dimensional data. In this paper, we introduce a new feature screening method and establish its sure independence screening property under the ultrahigh-dimensional setting. The proposed method works based on the nonparanormal transformation and Henze-Zirkler's test; that is, it first transforms the response variable and features to Gaussian random variables using the nonparanormal transformation and then tests the dependence between the response variable and features using the Henze-Zirkler's test. The proposed method enjoys at least two merits. First, it is model-free, which avoids the specification of a particular model structure. Second, it is condition-free, which does not require any extra conditions except for some regularity conditions for high-dimensional feature screening. The numerical results indicate that, compared to the existing methods, the proposed method is more robust to the data generated from heavy-tailed distributions and/or complex models with interaction variables. The proposed method is applied to screening of anticancer drug response genes.

SUBMITTER: Xue J 

PROVIDER: S-EPMC6284821 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data.

Xue Jingnan J   Liang Faming F  

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America 20171009 4


Feature screening plays an important role in dimension reduction for ultrahigh-dimensional data. In this paper, we introduce a new feature screening method and establish its sure independence screening property under the ultrahigh-dimensional setting. The proposed method works based on the nonparanormal transformation and Henze-Zirkler's test; that is, it first transforms the response variable and features to Gaussian random variables using the nonparanormal transformation and then tests the dep  ...[more]

Similar Datasets

| S-EPMC4574103 | biostudies-literature
| S-EPMC5019497 | biostudies-literature
| S-EPMC4993699 | biostudies-literature
| S-EPMC7988961 | biostudies-literature
| S-EPMC5890472 | biostudies-literature
| S-EPMC3963210 | biostudies-literature
| S-EPMC4172658 | biostudies-literature
| S-EPMC6495533 | biostudies-literature
| S-EPMC5561268 | biostudies-literature
| S-EPMC4219371 | biostudies-literature