Unknown

Dataset Information

0

Bias due to two-stage residual-outcome regression analysis in genetic association studies.


ABSTRACT: Association studies of risk factors and complex diseases require careful assessment of potential confounding factors. Two-stage regression analysis, sometimes referred to as residual- or adjusted-outcome analysis, has been increasingly used in association studies of single nucleotide polymorphisms (SNPs) and quantitative traits. In this analysis, first, a residual-outcome is calculated from a regression of the outcome variable on covariates and then the relationship between the adjusted-outcome and the SNP is evaluated by a simple linear regression of the adjusted-outcome on the SNP. In this article, we examine the performance of this two-stage analysis as compared with multiple linear regression (MLR) analysis. Our findings show that when a SNP and a covariate are correlated, the two-stage approach results in biased genotypic effect and loss of power. Bias is always toward the null and increases with the squared-correlation between the SNP and the covariate (). For example, for , 0.1, and 0.5, two-stage analysis results in, respectively, 0, 10, and 50% attenuation in the SNP effect. As expected, MLR was always unbiased. Since individual SNPs often show little or no correlation with covariates, a two-stage analysis is expected to perform as well as MLR in many genetic studies; however, it produces considerably different results from MLR and may lead to incorrect conclusions when independent variables are highly correlated. While a useful alternative to MLR under , the two -stage approach has serious limitations. Its use as a simple substitute for MLR should be avoided.

SUBMITTER: Demissie S 

PROVIDER: S-EPMC3201714 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bias due to two-stage residual-outcome regression analysis in genetic association studies.

Demissie Serkalem S   Cupples L Adrienne LA  

Genetic epidemiology 20110718 7


Association studies of risk factors and complex diseases require careful assessment of potential confounding factors. Two-stage regression analysis, sometimes referred to as residual- or adjusted-outcome analysis, has been increasingly used in association studies of single nucleotide polymorphisms (SNPs) and quantitative traits. In this analysis, first, a residual-outcome is calculated from a regression of the outcome variable on covariates and then the relationship between the adjusted-outcome  ...[more]

Similar Datasets

| S-EPMC4350584 | biostudies-literature
| S-EPMC3578817 | biostudies-literature
| S-EPMC6889171 | biostudies-literature
| S-EPMC3027114 | biostudies-literature
| S-EPMC6416071 | biostudies-literature
| S-EPMC7221498 | biostudies-literature
| S-EPMC7758823 | biostudies-literature
| S-EPMC3440236 | biostudies-literature
| S-EPMC6171163 | biostudies-literature
| S-EPMC9209005 | biostudies-literature