Unknown

Dataset Information

0

AraGWAB: Network-based boosting of genome-wide association studies in Arabidopsis thaliana.


ABSTRACT: Genome-wide association studies (GWAS) have been applied for the genetic dissection of complex phenotypes in Arabidopsis thaliana. However, the significantly associated single-nucleotide polymorphisms (SNPs) could not explain all the phenotypic variations. A major reason for missing true phenotype-associated loci is the strict P-value threshold after adjustment for multiple hypothesis tests to reduce false positives. This statistical limitation can be partly overcome by increasing the sample size, but at a much higher cost. Alternatively, weak phenotype-association signals can be boosted by integrating other types of data. Here, we present a web application for network-based Arabidopsis genome-wide association boosting-araGWAB-which augments the likelihood of association with the given phenotype by integrating GWAS summary statistics (SNP P-values) and co-functional gene network information. The integration utilized the inherent values of SNPs with subthreshold significance, thus substantially increasing the information usage of GWAS data. We found that araGWAB could more effectively retrieve genes known to be associated with various phenotypes relevant to defense against bacterial pathogens, flowering time regulation, and organ development in A. thaliana. We also found that many of the network-boosted candidate genes for the phenotypes were supported by previous publications. The araGWAB is freely available at http://www.inetbio.org/aragwab/ .

SUBMITTER: Lee T 

PROVIDER: S-EPMC5811503 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

araGWAB: Network-based boosting of genome-wide association studies in Arabidopsis thaliana.

Lee Tak T   Lee Insuk I  

Scientific reports 20180213 1


Genome-wide association studies (GWAS) have been applied for the genetic dissection of complex phenotypes in Arabidopsis thaliana. However, the significantly associated single-nucleotide polymorphisms (SNPs) could not explain all the phenotypic variations. A major reason for missing true phenotype-associated loci is the strict P-value threshold after adjustment for multiple hypothesis tests to reduce false positives. This statistical limitation can be partly overcome by increasing the sample siz  ...[more]

Similar Datasets

| S-EPMC5530424 | biostudies-literature
| S-EPMC3129253 | biostudies-literature
| S-EPMC4232226 | biostudies-literature
| S-EPMC5793838 | biostudies-literature
| S-EPMC3168369 | biostudies-literature
| S-EPMC3023908 | biostudies-literature
| S-EPMC4180040 | biostudies-literature
| S-EPMC6556901 | biostudies-other
| S-EPMC2678928 | biostudies-literature
| S-EPMC4463847 | biostudies-other