Unknown

Dataset Information

0

A general regression framework for a secondary outcome in case-control studies.


ABSTRACT: Modern case-control studies typically involve the collection of data on a large number of outcomes, often at considerable logistical and monetary expense. These data are of potentially great value to subsequent researchers, who, although not necessarily concerned with the disease that defined the case series in the original study, may want to use the available information for a regression analysis involving a secondary outcome. Because cases and controls are selected with unequal probability, regression analysis involving a secondary outcome generally must acknowledge the sampling design. In this paper, the author presents a new framework for the analysis of secondary outcomes in case-control studies. The approach is based on a careful re-parameterization of the conditional model for the secondary outcome given the case-control outcome and regression covariates, in terms of (a) the population regression of interest of the secondary outcome given covariates and (b) the population regression of the case-control outcome on covariates. The error distribution for the secondary outcome given covariates and case-control status is otherwise unrestricted. For a continuous outcome, the approach sometimes reduces to extending model (a) by including a residual of (b) as a covariate. However, the framework is general in the sense that models (a) and (b) can take any functional form, and the methodology allows for an identity, log or logit link function for model (a).

SUBMITTER: Tchetgen Tchetgen EJ 

PROVIDER: S-EPMC3983430 | biostudies-literature | 2014 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

A general regression framework for a secondary outcome in case-control studies.

Tchetgen Tchetgen Eric J EJ  

Biostatistics (Oxford, England) 20131022 1


Modern case-control studies typically involve the collection of data on a large number of outcomes, often at considerable logistical and monetary expense. These data are of potentially great value to subsequent researchers, who, although not necessarily concerned with the disease that defined the case series in the original study, may want to use the available information for a regression analysis involving a secondary outcome. Because cases and controls are selected with unequal probability, re  ...[more]

Similar Datasets

| S-EPMC5477998 | biostudies-literature
| S-EPMC3376500 | biostudies-literature
| S-EPMC5569006 | biostudies-literature
| S-EPMC6347118 | biostudies-literature
| S-EPMC1304888 | biostudies-literature
| S-EPMC3639015 | biostudies-literature
| S-EPMC3881430 | biostudies-literature
| S-EPMC4731052 | biostudies-literature
| S-EPMC3294270 | biostudies-literature
| S-EPMC2909900 | biostudies-literature