Unknown

Dataset Information

0

Accurate and fast multiple-testing correction in eQTL studies.


ABSTRACT: In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to obtain a gene-level p value. For performing multiple-testing correction, a permutation test is widely used. Because of growing sample sizes of eQTL studies, however, the permutation test has become a computational bottleneck in eQTL studies. In this paper, we propose an efficient approach for correcting for multiple testing and assess eGene p values by utilizing a multivariate normal distribution. Our approach properly takes into account the linkage-disequilibrium structure among variants, and its time complexity is independent of sample size. By applying our small-sample correction techniques, our method achieves high accuracy in both small and large studies. We have shown that our method consistently produces extremely accurate p values (accuracy > 98%) for three human eQTL datasets with different sample sizes and SNP densities: the Genotype-Tissue Expression pilot dataset, the multi-region brain dataset, and the HapMap 3 dataset.

SUBMITTER: Sul JH 

PROVIDER: S-EPMC4457958 | biostudies-literature | 2015 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate and fast multiple-testing correction in eQTL studies.

Sul Jae Hoon JH   Raj Towfique T   de Jong Simone S   de Bakker Paul I W PI   Raychaudhuri Soumya S   Ophoff Roel A RA   Stranger Barbara E BE   Eskin Eleazar E   Han Buhm B  

American journal of human genetics 20150528 6


In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to  ...[more]

Similar Datasets

| S-EPMC6970436 | biostudies-literature
| S-EPMC4469571 | biostudies-literature
| S-EPMC7881646 | biostudies-literature
| S-EPMC7132638 | biostudies-literature
| S-EPMC4716687 | biostudies-literature
| S-EPMC2663787 | biostudies-literature
| S-EPMC6941997 | biostudies-literature
| S-EPMC6280799 | biostudies-other
| S-EPMC7005598 | biostudies-literature
| S-EPMC4818520 | biostudies-literature