Dataset Information

The choice of null distributions for detecting gene-gene interactions in genome-wide association studies.

ABSTRACT:

Background

In genome-wide association studies (GWAS), the number of single-nucleotide polymorphisms (SNPs) typically ranges between 500,000 and 1,000,000. Accordingly, detecting gene-gene interactions in GWAS is computationally challenging because it involves hundreds of billions of SNP pairs. Stage-wise strategies are often used to overcome the computational difficulty. In the first stage, fast screening methods (e.g. Tuning ReliefF) are applied to reduce the whole SNP set to a small subset. In the second stage, sophisticated modeling methods (e.g., multifactor-dimensionality reduction (MDR)) are applied to the subset of SNPs to identify interesting interaction models and the corresponding interaction patterns. In the third stage, the significance of the identified interaction patterns is evaluated by hypothesis testing.

Results

In this paper, we show that this stage-wise strategy could be problematic in controlling the false positive rate if the null distribution is not appropriately chosen. This is because screening and modeling may change the null distribution used in hypothesis testing. In our simulation study, we use some popular screening methods and the popular modeling method MDR as examples to show the effect of the inappropriate choice of null distributions. To choose appropriate null distributions, we suggest to use the permutation test or testing on the independent data set. We demonstrate their performance using synthetic data and a real genome wide data set from an Aged-related Macular Degeneration (AMD) study.

Conclusions

The permutation test or testing on the independent data set can help choosing appropriate null distributions in hypothesis testing, which provides more reliable results in practice.

SUBMITTER: Yang C

PROVIDER: S-EPMC3044281 | biostudies-literature | 2011 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

The choice of null distributions for detecting gene-gene interactions in genome-wide association studies.

Yang Can C Wan Xiang X He Zengyou Z Yang Qiang Q Xue Hong H Yu Weichuan W

BMC bioinformatics 20110215

<h4>Background</h4>In genome-wide association studies (GWAS), the number of single-nucleotide polymorphisms (SNPs) typically ranges between 500,000 and 1,000,000. Accordingly, detecting gene-gene interactions in GWAS is computationally challenging because it involves hundreds of billions of SNP pairs. Stage-wise strategies are often used to overcome the computational difficulty. In the first stage, fast screening methods (e.g. Tuning ReliefF) are applied to reduce the whole SNP set to a small su ...[more]

PMID: 21342556

Dataset Information

The choice of null distributions for detecting gene-gene interactions in genome-wide association studies.

Background

Results

Conclusions

Publications

The choice of null distributions for detecting gene-gene interactions in genome-wide association studies.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES.
| S-EPMC4595934 | biostudies-literature

Detecting gene-environment interactions in genome-wide association data.
| S-EPMC2924567 | biostudies-literature

Testing gene-gene interactions in genome wide association studies.
| S-EPMC4487553 | biostudies-literature

RAPID detection of gene-gene interactions in genome-wide association studies.
| S-EPMC3493125 | biostudies-literature

bNEAT: a Bayesian network method for detecting epistatic interactions in genome-wide association studies.
| S-EPMC3194240 | biostudies-literature

BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies.
| S-EPMC2933337 | biostudies-literature

Detecting signals in pharmacogenomic genome-wide association studies.
| S-EPMC4085158 | biostudies-literature

GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies.
| S-EPMC3105448 | biostudies-literature

Detecting Gene-Environment Interactions for a Quantitative Trait in a Genome-Wide Association Study.
| S-EPMC5108681 | biostudies-literature

Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies.
| S-EPMC8716787 | biostudies-literature