Dataset Information

CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

ABSTRACT: We consider large-scale studies in which thousands of significance tests are performed simultaneously. In some of these studies, the multiple testing procedure can be severely biased by latent confounding factors such as batch effects and unmeasured covariates that correlate with both primary variable(s) of interest (e.g., treatment variable, phenotype) and the outcome. Over the past decade, many statistical methods have been proposed to adjust for the confounders in hypothesis testing. We unify these methods in the same framework, generalize them to include multiple primary variables and multiple nuisance variables, and analyze their statistical properties. In particular, we provide theoretical guarantees for RUV-4 [Gagnon-Bartsch, Jacob and Speed (2013)] and LEAPP [Ann. Appl. Stat. 6 (2012) 1664-1688], which correspond to two different identification conditions in the framework: the first requires a set of "negative controls" that are known a priori to follow the null distribution; the second requires the true nonnulls to be sparse. Two different estimators which are based on RUV-4 and LEAPP are then applied to these two scenarios. We show that if the confounding factors are strong, the resulting estimators can be asymptotically as powerful as the oracle estimator which observes the latent confounding factors. For hypothesis testing, we show the asymptotic z-tests based on the estimators can control the type I error. Numerical experiments show that the false discovery rate is also controlled by the Benjamini-Hochberg procedure when the sample size is reasonably large.

SUBMITTER: Wang J

PROVIDER: S-EPMC6706069 | biostudies-literature | 2017 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

Wang Jingshu J Zhao Qingyuan Q Hastie Trevor T Owen Art B AB

Annals of statistics 20171031 5

We consider large-scale studies in which thousands of significance tests are performed simultaneously. In some of these studies, the multiple testing procedure can be severely biased by latent confounding factors such as batch effects and unmeasured covariates that correlate with both primary variable(s) of interest (e.g., treatment variable, phenotype) and the outcome. Over the past decade, many statistical methods have been proposed to adjust for the confounders in hypothesis testing. We unify ...[more]

PMID: 31439967

Similar Datasets

Project description:BackgroundConfounder adjustment is critical for accurate causal inference in observational studies. However, the appropriateness of methods for confounder adjustment in studies investigating multiple risk factors, where the factors are not simply mutually confounded, is often overlooked. This study aims to summarise the methods for confounder adjustment and the related issues in studies investigating multiple risk factors.MethodsA methodological study was performed. We searched PubMed from January 2018 to March 2023 to identify cohort and case-control studies investigating multiple risk factors for three chronic diseases (cardiovascular disease, diabetes and dementia). Study selection and data extraction were conducted independently by two reviewers. The study objectives were grouped into two categories: widely exploring potential risk factors and examining specific risk factors. The methods for confounder adjustment were classified based on a summarisation of the included studies, identifying six categories: (1) each risk factor was adjusted for potential confounders separately (the recommended method); (2) all risk factors were mutually adjusted (i.e. including all factors in a multivariable model); (3) all risk factors were adjusted for the same confounders separately; (4) all risk factors were adjusted for the same confounders with some factors being mutually adjusted; (5) all risk factors were adjusted for the same confounders with mutual adjustment among them being unclear; and (6) unable to judge. All data were descriptively analysed.ResultsA total of 162 studies were included, with 88 (54.3%) exploring potential risk factors and 74 (45.7%) examining specific risk factors. The current status of confounder adjustment was unsatisfactory: only ten studies (6.2%) used the recommended method, all of which aimed at examining several specific risk factors; in contrast, mutual adjustment was adopted in over 70% of the studies. The remaining studies either adjusted for the same confounders across all risk factors, or unable to judge.ConclusionsThere is substantial variation in the methods for confounder adjustment among studies investigating multiple risk factors. Mutual adjustment was the most commonly adopted method, which might lead to overadjustment bias and misleading effect estimates. Future research should avoid indiscriminately including all risk factors in a multivariable model to prevent inappropriate adjustment.

Dataset Information

CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

Publications

CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets