Unknown

Dataset Information

0

A Modified Random Survival Forests Algorithm for High Dimensional Predictors and Self-Reported Outcomes.


ABSTRACT: We present an ensemble tree-based algorithm for variable selection in high dimensional datasets, in settings where a time-to-event outcome is observed with error. The proposed methods are motivated by self-reported outcomes collected in large-scale epidemiologic studies, such as the Women's Health Initiative. The proposed methods equally apply to imperfect outcomes that arise in other settings such as data extracted from electronic medical records. To evaluate the performance of our proposed algorithm, we present results from simulation studies, considering both continuous and categorical covariates. We illustrate this approach to discover single nucleotide polymorphisms that are associated with incident Type II diabetes in the Women's Health Initiative. A freely available R package icRSF (R Core Team, 2018; Xu et al., 2018) has been developed to implement the proposed methods.

SUBMITTER: Xu H 

PROVIDER: S-EPMC6369914 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Modified Random Survival Forests Algorithm for High Dimensional Predictors and Self-Reported Outcomes.

Xu Hui H   Gu Xiangdong X   Tadesse Mahlet G MG   Balasubramanian Raji R  

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America 20180820 4


We present an ensemble tree-based algorithm for variable selection in high dimensional datasets, in settings where a time-to-event outcome is observed with error. The proposed methods are motivated by self-reported outcomes collected in large-scale epidemiologic studies, such as the Women's Health Initiative. The proposed methods equally apply to imperfect outcomes that arise in other settings such as data extracted from electronic medical records. To evaluate the performance of our proposed alg  ...[more]

Similar Datasets

| S-EPMC7487595 | biostudies-literature
| S-EPMC3495190 | biostudies-literature
| S-EPMC2889677 | biostudies-literature
| S-EPMC6368971 | biostudies-literature
| S-EPMC2804301 | biostudies-literature
| S-EPMC3530909 | biostudies-literature
| S-EPMC4173102 | biostudies-literature
| S-EPMC8716055 | biostudies-literature
| S-EPMC6959482 | biostudies-literature
| S-EPMC11343578 | biostudies-literature