Ontology highlight
ABSTRACT:
SUBMITTER: Hou J
PROVIDER: S-EPMC10947223 | biostudies-literature | 2023 Jan-Dec
REPOSITORIES: biostudies-literature
Hou Jue J Guo Zijian Z Cai Tianxi T
Journal of machine learning research : JMLR 20230101
Risk modeling with electronic health records (EHR) data is challenging due to no direct observations of the disease outcome and the high-dimensional predictors. In this paper, we develop a surrogate assisted semi-supervised learning approach, leveraging small labeled data with annotated outcomes and extensive unlabeled data of outcome surrogates and high-dimensional predictors. We propose to impute the unobserved outcomes by constructing a sparse imputation model with outcome surrogates and high ...[more]