Unknown

Dataset Information

0

Constraints on eQTL Fine Mapping in the Presence of Multisite Local Regulation of Gene Expression.


ABSTRACT: Expression quantitative trait locus (eQTL) detection has emerged as an important tool for unraveling of the relationship between genetic risk factors and disease or clinical phenotypes. Most studies use single marker linear regression to discover primary signals, followed by sequential conditional modeling to detect secondary genetic variants affecting gene expression. However, this approach assumes that functional variants are sparsely distributed and that close linkage between them has little impact on estimation of their precise location and the magnitude of effects. We describe a series of simulation studies designed to evaluate the impact of linkage disequilibrium (LD) on the fine mapping of causal variants with typical eQTL effect sizes. In the presence of multisite regulation, even though between 80 and 90% of modeled eSNPs associate with normally distributed traits, up to 10% of all secondary signals could be statistical artifacts, and at least 5% but up to one-quarter of credible intervals of SNPs within r2 > 0.8 of the peak may not even include a causal site. The Bayesian methods eCAVIAR and DAP (Deterministic Approximation of Posteriors) provide only modest improvement in resolution. Given the strong empirical evidence that gene expression is commonly regulated by more than one variant, we conclude that the fine mapping of causal variants needs to be adjusted for multisite influences, as conditional estimates can be highly biased by interference among linked sites, but ultimately experimental verification of individual effects is needed. Presumably similar conclusions apply not just to eQTL mapping, but to multisite influences on fine mapping of most types of quantitative trait.

SUBMITTER: Zeng B 

PROVIDER: S-EPMC5555460 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications


Expression quantitative trait locus (eQTL) detection has emerged as an important tool for unraveling of the relationship between genetic risk factors and disease or clinical phenotypes. Most studies use single marker linear regression to discover primary signals, followed by sequential conditional modeling to detect secondary genetic variants affecting gene expression. However, this approach assumes that functional variants are sparsely distributed and that close linkage between them has little  ...[more]

Similar Datasets

| S-EPMC7334875 | biostudies-literature
| S-EPMC10616627 | biostudies-literature
| S-EPMC8150140 | biostudies-literature
| S-EPMC6952111 | biostudies-literature
2019-11-28 | GSE140553 | GEO
| S-EPMC3118554 | biostudies-literature
| S-EPMC4376710 | biostudies-literature
| S-EPMC7118581 | biostudies-literature
| S-EPMC3014048 | biostudies-literature
| PRJNA590167 | ENA