Unknown

Dataset Information

0

A Generic Sure Independence Screening Procedure.


ABSTRACT: Extracting important features from ultra-high dimensional data is one of the primary tasks in statistical learning, information theory, precision medicine and biological discovery. Many of the sure independent screening methods developed to meet these needs are suitable for special models under some assumptions. With the availability of more data types and possible models, a model-free generic screening procedure with fewer and less restrictive assumptions is desirable. In this paper, we propose a generic nonparametric sure independence screening procedure, called BCor-SIS, on the basis of a recently developed universal dependence measure: Ball correlation. We show that the proposed procedure has strong screening consistency even when the dimensionality is an exponential order of the sample size without imposing sub-exponential moment assumptions on the data. We investigate the flexibility of this procedure by considering three commonly encountered challenging settings in biological discovery or precision medicine: iterative BCor-SIS, interaction pursuit, and survival outcomes. We use simulation studies and real data analyses to illustrate the versatility and practicability of our BCor-SIS method.

SUBMITTER: Pan W 

PROVIDER: S-EPMC6831100 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Generic Sure Independence Screening Procedure.

Pan Wenliang W   Wang Xueqin X   Xiao Weinan W   Zhu Hongtu H  

Journal of the American Statistical Association 20180806 526


Extracting important features from ultra-high dimensional data is one of the primary tasks in statistical learning, information theory, precision medicine and biological discovery. Many of the sure independent screening methods developed to meet these needs are suitable for special models under some assumptions. With the availability of more data types and possible models, a model-free generic screening procedure with fewer and less restrictive assumptions is desirable. In this paper, we propose  ...[more]

Similar Datasets

| S-EPMC5367860 | biostudies-literature
| S-EPMC3887322 | biostudies-literature
| S-EPMC4368776 | biostudies-literature
| S-EPMC3293491 | biostudies-literature
| S-EPMC5308866 | biostudies-literature
| S-EPMC4142445 | biostudies-literature
| S-EPMC7507262 | biostudies-literature
| S-EPMC4318124 | biostudies-literature
| S-EPMC4610706 | biostudies-literature
| S-EPMC4705515 | biostudies-literature