Unknown

Dataset Information

0

Estimation of the optimal surrogate based on a randomized trial.


ABSTRACT: A common scientific problem is to determine a surrogate outcome for a long-term outcome so that future randomized studies can restrict themselves to only collecting the surrogate outcome. We consider the setting that we observe n independent and identically distributed observations of a random variable consisting of baseline covariates, a treatment, a vector of candidate surrogate outcomes at an intermediate time point, and the final outcome of interest at a final time point. We assume the treatment is randomized, conditional on the baseline covariates. The goal is to use these data to learn a most-promising surrogate for use in future trials for inference about a mean contrast treatment effect on the final outcome. We define an optimal surrogate for the current study as the function of the data generating distribution collected by the intermediate time point that satisfies the Prentice definition of a valid surrogate endpoint and that optimally predicts the final outcome: this optimal surrogate is an unknown parameter. We show that this optimal surrogate is a conditional mean and present super-learner and targeted super-learner based estimators, whose predicted outcomes are used as the surrogate in applications. We demonstrate a number of desirable properties of this optimal surrogate and its estimators, and study the methodology in simulations and an application to dengue vaccine efficacy trials.

SUBMITTER: Price BL 

PROVIDER: S-EPMC6393111 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Estimation of the optimal surrogate based on a randomized trial.

Price Brenda L BL   Gilbert Peter B PB   van der Laan Mark J MJ  

Biometrics 20180427 4


A common scientific problem is to determine a surrogate outcome for a long-term outcome so that future randomized studies can restrict themselves to only collecting the surrogate outcome. We consider the setting that we observe n independent and identically distributed observations of a random variable consisting of baseline covariates, a treatment, a vector of candidate surrogate outcomes at an intermediate time point, and the final outcome of interest at a final time point. We assume the treat  ...[more]

Similar Datasets

| S-EPMC4836946 | biostudies-literature
| S-EPMC6649632 | biostudies-literature
| S-EPMC7890479 | biostudies-literature
| S-EPMC6117357 | biostudies-literature
| S-EPMC8551833 | biostudies-literature
| S-EPMC8139150 | biostudies-literature
| S-EPMC8328375 | biostudies-literature
| S-EPMC2679823 | biostudies-literature
| S-EPMC6466704 | biostudies-literature
| S-EPMC7776804 | biostudies-literature