Dataset Information

A permutation procedure to correct for confounders in case-control studies, including tests of rare variation.

ABSTRACT: Many case-control tests of rare variation are implemented in statistical frameworks that make correction for confounders like population stratification difficult. Simple permutation of disease status is unacceptable for resolving this issue because the replicate data sets do not have the same confounding as the original data set. These limitations make it difficult to apply rare-variant tests to samples in which confounding most likely exists, e.g., samples collected from admixed populations. To enable the use of such rare-variant methods in structured samples, as well as to facilitate permutation tests for any situation in which case-control tests require adjustment for confounding covariates, we propose to establish the significance of a rare-variant test via a modified permutation procedure. Our procedure uses Fisher's noncentral hypergeometric distribution to generate permuted data sets with the same structure present in the actual data set such that inference is valid in the presence of confounding factors. We use simulated sequence data based on coalescent models to show that our permutation strategy corrects for confounding due to population stratification that, if ignored, would otherwise inflate the size of a rare-variant test. We further illustrate the approach by using sequence data from the Dallas Heart Study of energy metabolism traits. Researchers can implement our permutation approach by using the R package BiasedUrn.

SUBMITTER: Epstein MP

PROVIDER: S-EPMC3415546 | biostudies-literature | 2012 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A permutation procedure to correct for confounders in case-control studies, including tests of rare variation.

Epstein Michael P MP Duncan Richard R Jiang Yunxuan Y Conneely Karen N KN Allen Andrew S AS Satten Glen A GA

American journal of human genetics 20120719 2

Many case-control tests of rare variation are implemented in statistical frameworks that make correction for confounders like population stratification difficult. Simple permutation of disease status is unacceptable for resolving this issue because the replicate data sets do not have the same confounding as the original data set. These limitations make it difficult to apply rare-variant tests to samples in which confounding most likely exists, e.g., samples collected from admixed populations. To ...[more]

PMID: 22818855

Dataset Information

A permutation procedure to correct for confounders in case-control studies, including tests of rare variation.

Publications

A permutation procedure to correct for confounders in case-control studies, including tests of rare variation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Sampling strategies for rare variant tests in case-control studies.
| S-EPMC3449077 | biostudies-literature

Integrating external controls in case-control studies improves power for rare-variant tests.
| S-EPMC9393083 | biostudies-literature

Parallelized calculation of permutation tests.
| S-EPMC8016463 | biostudies-literature

A permutation method for detecting trend correlations in rare variant association studies.
| S-EPMC7044977 | biostudies-literature

Permutation Tests for General Dependent Truncation.
| S-EPMC6317381 | biostudies-literature

Permutation - based statistical tests for multiple hypotheses.
| S-EPMC2611984 | biostudies-literature

An exponential combination procedure for set-based association tests in sequencing studies.
| S-EPMC3516612 | biostudies-literature

Pooled association tests for rare variants in exon-resequencing studies.
| S-EPMC3032073 | biostudies-literature

Optimal tests for rare variant effects in sequencing association studies.
| S-EPMC3440237 | biostudies-literature

Growth rate inhibition metrics correct for confounders in measuring sensitivity to cancer drugs.
| S-EPMC4887336 | biostudies-literature