Unknown

Dataset Information

0

Optimal auxiliary-covariate-based two-phase sampling design for semiparametric efficient estimation of a mean or mean difference, with application to clinical trials.


ABSTRACT: To address the objective in a clinical trial to estimate the mean or mean difference of an expensive endpoint Y, one approach employs a two-phase sampling design, wherein inexpensive auxiliary variables W predictive of Y are measured in everyone, Y is measured in a random sample, and the semiparametric efficient estimator is applied. This approach is made efficient by specifying the phase two selection probabilities as optimal functions of the auxiliary variables and measurement costs. While this approach is familiar to survey samplers, it apparently has seldom been used in clinical trials, and several novel results practicable for clinical trials are developed. We perform simulations to identify settings where the optimal approach significantly improves efficiency compared to approaches in current practice. We provide proofs and R code. The optimality results are developed to design an HIV vaccine trial, with objective to compare the mean 'importance-weighted' breadth (Y) of the T-cell response between randomized vaccine groups. The trial collects an auxiliary response (W) highly predictive of Y and measures Y in the optimal subset. We show that the optimal design-estimation approach can confer anywhere between absent and large efficiency gain (up to 24 % in the examples) compared to the approach with the same efficient estimator but simple random sampling, where greater variability in the cost-standardized conditional variance of Y given W yields greater efficiency gains. Accurate estimation of E[Y?|?W] is important for realizing the efficiency gain, which is aided by an ample phase two sample and by using a robust fitting method.

SUBMITTER: Gilbert PB 

PROVIDER: S-EPMC4021041 | biostudies-literature | 2014 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Optimal auxiliary-covariate-based two-phase sampling design for semiparametric efficient estimation of a mean or mean difference, with application to clinical trials.

Gilbert Peter B PB   Yu Xuesong X   Rotnitzky Andrea A  

Statistics in medicine 20131009 6


To address the objective in a clinical trial to estimate the mean or mean difference of an expensive endpoint Y, one approach employs a two-phase sampling design, wherein inexpensive auxiliary variables W predictive of Y are measured in everyone, Y is measured in a random sample, and the semiparametric efficient estimator is applied. This approach is made efficient by specifying the phase two selection probabilities as optimal functions of the auxiliary variables and measurement costs. While thi  ...[more]

Similar Datasets

| S-EPMC2892338 | biostudies-literature
| S-EPMC3944970 | biostudies-literature
| S-EPMC6916299 | biostudies-literature
| S-EPMC3922234 | biostudies-literature
| S-EPMC7384929 | biostudies-literature
| S-EPMC8172256 | biostudies-literature
| S-EPMC3855673 | biostudies-literature
| S-EPMC4480066 | biostudies-literature
| S-EPMC3596883 | biostudies-literature
| S-EPMC3114654 | biostudies-literature