Unknown

Dataset Information

0

Sparse simultaneous signal detection for identifying genetically controlled disease genes.


ABSTRACT: Genome-wide association studies (GWAS) and differential expression analyses have had limited success in finding genes that cause complex diseases such as heart failure (HF), a leading cause of death in the United States. This paper proposes a new statistical approach that integrates GWAS and expression quantitative trait loci (eQTL) data to identify important HF genes. For such genes, genetic variations that perturb its expression are also likely to influence disease risk. The proposed method thus tests for the presence of simultaneous signals: SNPs that are associated with the gene's expression as well as with disease. An analytic expression for the p-value is obtained, and the method is shown to be asymptotically adaptively optimal under certain conditions. It also allows the GWAS and eQTL data to be collected from different groups of subjects, enabling investigators to integrate public resources with their own data. Simulation experiments show that it can be more powerful than standard approaches and also robust to linkage disequilibrium between variants. The method is applied to an extensive analysis of HF genomics and identifies several genes with biological evidence for being functionally relevant in the etiology of HF. It is implemented in the R package ssa.

SUBMITTER: Zhao SD 

PROVIDER: S-EPMC5784841 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sparse simultaneous signal detection for identifying genetically controlled disease genes.

Zhao Sihai Dave SD   Cai T Tony TT   Cappola Thomas P TP   Margulies Kenneth B KB   Li Hongzhe H  

Journal of the American Statistical Association 20170105 519


Genome-wide association studies (GWAS) and differential expression analyses have had limited success in finding genes that cause complex diseases such as heart failure (HF), a leading cause of death in the United States. This paper proposes a new statistical approach that integrates GWAS and expression quantitative trait loci (eQTL) data to identify important HF genes. For such genes, genetic variations that perturb its expression are also likely to influence disease risk. The proposed method th  ...[more]

Similar Datasets

| S-EPMC6513717 | biostudies-literature
| S-EPMC6767648 | biostudies-literature
| S-EPMC4784713 | biostudies-literature
| S-EPMC4080512 | biostudies-literature
2010-08-13 | GSE12960 | GEO
| S-EPMC8375409 | biostudies-literature
| S-EPMC3283562 | biostudies-literature
| S-EPMC8152090 | biostudies-literature
2009-12-10 | GSE19267 | GEO
2009-12-09 | E-GEOD-19267 | biostudies-arrayexpress