Dataset Information

Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies.

ABSTRACT: Genome-wide association studies commonly involve simultaneous tests of millions of single nucleotide polymorphisms (SNP) for disease association. The SNPs in nearby genomic regions, however, are often highly correlated due to linkage disequilibrium (LD, a genetic term for correlation). Simple Bonferonni correction for multiple comparisons is therefore too conservative. Permutation tests, which are often employed in practice, are both computationally expensive for genome-wide studies and limited in their scopes. We present an accurate and computationally efficient method, based on Poisson de-clumping heuristics, for approximating genome-wide significance of SNP associations. Compared with permutation tests and other multiple comparison adjustment approaches, our method computes the most accurate and robust p-value adjustments for millions of correlated comparisons within seconds. We demonstrate analytically that the accuracy and the efficiency of our method are nearly independent of the sample size, the number of SNPs, and the scale of p-values to be adjusted. In addition, our method can be easily adopted to estimate false discovery rate. When applied to genome-wide SNP datasets, we observed highly variable p-value adjustment results evaluated from different genomic regions. The variation in adjustments along the genome, however, are well conserved between the European and the African populations. The p-value adjustments are significantly correlated with LD among SNPs, recombination rates, and SNP densities. Given the large variability of sequence features in the genome, we further discuss a novel approach of using SNP-specific (local) thresholds to detect genome-wide significant associations. This article has supplementary material online.

SUBMITTER: Zhang Y

PROVIDER: S-EPMC3226809 | biostudies-literature | 2011 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies.

Zhang Yu Y Liu Jun S JS

Journal of the American Statistical Association 20110901 495

Genome-wide association studies commonly involve simultaneous tests of millions of single nucleotide polymorphisms (SNP) for disease association. The SNPs in nearby genomic regions, however, are often highly correlated due to linkage disequilibrium (LD, a genetic term for correlation). Simple Bonferonni correction for multiple comparisons is therefore too conservative. Permutation tests, which are often employed in practice, are both computationally expensive for genome-wide studies and limited ...[more]

PMID: 22140288

Dataset Information

Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies.

Publications

Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Fast and general tests of genetic interaction for genome-wide association studies.
| S-EPMC5478145 | biostudies-literature

Fast pairwise IBD association testing in genome-wide association studies.
| S-EPMC3892684 | biostudies-literature

Fast and Accurate Genome-Wide Association Test of Multiple Quantitative Traits.
| S-EPMC5878919 | biostudies-literature

Robust Association Tests for the Replication of Genome-Wide Association Studies.
| S-EPMC4539975 | biostudies-literature

Resampling-based tests for Lasso in genome-wide association studies.
| S-EPMC5525347 | biostudies-literature

FAPI: Fast and accurate P-value Imputation for genome-wide association study.
| S-EPMC4930094 | biostudies-literature

Multiple phenotype association tests using summary statistics in genome-wide association studies.
| S-EPMC5743780 | biostudies-literature

Estimation of a significance threshold for genome-wide association studies.
| S-EPMC6664749 | biostudies-literature

BLUPmrMLM: A Fast mrMLM Algorithm in Genome-wide Association Studies.
| S-EPMC12016565 | biostudies-literature

LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies.
| S-EPMC6659841 | biostudies-literature