Unknown

Dataset Information

0

Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS.


ABSTRACT: Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended.

SUBMITTER: Kwon JS 

PROVIDER: S-EPMC3480679 | biostudies-literature | 2012 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS.

Kwon Ji-Sun JS   Kim Jihye J   Nam Dougu D   Kim Sangsoo S  

Genomics & informatics 20120630 2


Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of t  ...[more]

Similar Datasets

| S-EPMC2896081 | biostudies-literature
| S-EPMC4621033 | biostudies-literature
| S-EPMC3643925 | biostudies-literature
| S-EPMC3786949 | biostudies-literature
| S-EPMC2814805 | biostudies-other
| S-EPMC5984476 | biostudies-literature
| S-EPMC11329975 | biostudies-literature
| S-EPMC3032061 | biostudies-literature
| S-EPMC3079664 | biostudies-literature
| S-EPMC6537193 | biostudies-literature