Dataset Information

Interpretation of association signals and identification of causal variants from genome-wide association studies.

ABSTRACT: GWAS have been successful in identifying disease susceptibility loci, but it remains a challenge to pinpoint the causal variants in subsequent fine-mapping studies. A conventional fine-mapping effort starts by sequencing dozens of randomly selected samples at susceptibility loci to discover candidate variants, which are then placed on custom arrays or used in imputation algorithms to find the causal variants. We propose that one or several rare or low-frequency causal variants can hitchhike the same common tag SNP, so causal variants may not be easily unveiled by conventional efforts. Here, we first demonstrate that the true effect size and proportion of variance explained by a collection of rare causal variants can be underestimated by a common tag SNP, thereby accounting for some of the "missing heritability" in GWAS. We then describe a case-selection approach based on phasing long-range haplotypes and sequencing cases predicted to harbor causal variants. We compare this approach with conventional strategies on a simulated data set, and we demonstrate its advantages when multiple causal variants are present. We also evaluate this approach in a GWAS on hearing loss, where the most common causal variant has a minor allele frequency (MAF) of 1.3% in the general population and 8.2% in 329 cases. With our case-selection approach, it is present in 88% of the 32 selected cases (MAF = 66%), so sequencing a subset of these cases can readily reveal the causal allele. Our results suggest that thinking beyond common variants is essential in interpreting GWAS signals and identifying causal variants.

SUBMITTER: Wang K

PROVIDER: S-EPMC2869011 | biostudies-literature | 2010 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Interpretation of association signals and identification of causal variants from genome-wide association studies.

Wang Kai K Dickson Samuel P SP Stolle Catherine A CA Krantz Ian D ID Goldstein David B DB Hakonarson Hakon H

American journal of human genetics 20100429 5

GWAS have been successful in identifying disease susceptibility loci, but it remains a challenge to pinpoint the causal variants in subsequent fine-mapping studies. A conventional fine-mapping effort starts by sequencing dozens of randomly selected samples at susceptibility loci to discover candidate variants, which are then placed on custom arrays or used in imputation algorithms to find the causal variants. We propose that one or several rare or low-frequency causal variants can hitchhike the ...[more]

PMID: 20434130

Dataset Information

Interpretation of association signals and identification of causal variants from genome-wide association studies.

Publications

Interpretation of association signals and identification of causal variants from genome-wide association studies.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies.
| S-EPMC9684164 | biostudies-literature

Network-assisted investigation of combined causal signals from genome-wide association studies in schizophrenia.
| S-EPMC3390381 | biostudies-literature

KnockoffTrio: A knockoff framework for the identification of putative causal variants in genome-wide association studies with trio design.
| S-EPMC9606389 | biostudies-literature

Detecting signals in pharmacogenomic genome-wide association studies.
| S-EPMC4085158 | biostudies-literature

Genome-wide fine-mapping improves identification of causal variants.
| S-EPMC11275676 | biostudies-literature

Identification of disease-associate variants of aggressive periodontitis using genome-wide association studies.
| S-EPMC10582758 | biostudies-literature

Biological interpretation of genome-wide association studies using predicted gene functions.
| S-EPMC4420238 | biostudies-literature

CAUSALdb: a database for disease/trait causal variants identified using summary statistics of genome-wide association studies.
| S-EPMC7145620 | biostudies-literature

On the use of general control samples for genome-wide association studies: genetic matching highlights causal variants.
| S-EPMC2427172 | biostudies-literature

Identification of Pleiotropic Cancer Susceptibility Variants from Genome-Wide Association Studies Reveals Functional Characteristics.
| S-EPMC5760292 | biostudies-literature