Ontology highlight
ABSTRACT: Summary
level data of GWAS becomes increasingly important in post-GWAS data mining. Here, we present GIGSEA (Genotype Imputed Gene Set Enrichment Analysis), a novel method that uses GWAS summary statistics and eQTL to infer differential gene expression and interrogate gene set enrichment for the trait-associated SNPs. By incorporating empirical eQTL of the disease relevant tissue, GIGSEA naturally accounts for factors such as gene size, gene boundary, SNP distal regulation and multiple-marker regulation. The weighted linear regression model was used to perform the enrichment test, properly adjusting for imputation accuracy, model incompleteness and redundancy in different gene sets. The significance level of enrichment is assessed by the permutation test, where matrix operation was employed to dramatically improve computation speed. GIGSEA has appropriate type I error rates, and discovers the plausible biological findings on the real data set.Availability and implementation
GIGSEA is implemented in R, and freely available at www.github.com/zhushijia/GIGSEA.Supplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Zhu S
PROVIDER: S-EPMC6298047 | biostudies-literature | 2019 Jan
REPOSITORIES: biostudies-literature
Zhu Shijia S Qian Tongqi T Hoshida Yujin Y Shen Yuan Y Yu Jing J Hao Ke K
Bioinformatics (Oxford, England) 20190101 1
<h4>Summary</h4>level data of GWAS becomes increasingly important in post-GWAS data mining. Here, we present GIGSEA (Genotype Imputed Gene Set Enrichment Analysis), a novel method that uses GWAS summary statistics and eQTL to infer differential gene expression and interrogate gene set enrichment for the trait-associated SNPs. By incorporating empirical eQTL of the disease relevant tissue, GIGSEA naturally accounts for factors such as gene size, gene boundary, SNP distal regulation and multiple-m ...[more]