Unknown

Dataset Information

0

Association analyses of the MAS-QTL data set using grammar, principal components and Bayesian network methodologies.


ABSTRACT:

Background

It has been shown that if genetic relationships among individuals are not taken into account for genome wide association studies, this may lead to false positives. To address this problem, we used Genome-wide Rapid Association using Mixed Model and Regression and principal component stratification analyses. To account for linkage disequilibrium among the significant markers, principal components loadings obtained from top markers can be included as covariates. Estimation of Bayesian networks may also be useful to investigate linkage disequilibrium among SNPs and their relation with environmental variables.For the quantitative trait we first estimated residuals while taking polygenic effects into account. We then used a single SNP approach to detect the most significant SNPs based on the residuals and applied principal component regression to take linkage disequilibrium among these SNPs into account. For the categorical trait we used principal component stratification methodology to account for background effects. For correction of linkage disequilibrium we used principal component logit regression. Bayesian networks were estimated to investigate relationship among SNPs.

Results

Using the Genome-wide Rapid Association using Mixed Model and Regression and principal component stratification approach we detected around 100 significant SNPs for the quantitative trait (p<0.05 with 1000 permutations) and 109 significant (p<0.0006 with local FDR correction) SNPs for the categorical trait. With additional principal component regression we reduced the list to 16 and 50 SNPs for the quantitative and categorical trait, respectively.

Conclusions

GRAMMAR could efficiently incorporate the information regarding random genetic effects. Principal component stratification should be cautiously used with stringent multiple hypothesis testing correction to correct for ancestral stratification and association analyses for binary traits when there are systematic genetic effects such as half sib family structures. Bayesian networks are useful to investigate relationships among SNPs and environmental variables.

SUBMITTER: Karacaoren B 

PROVIDER: S-EPMC3103207 | biostudies-literature | 2011 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Association analyses of the MAS-QTL data set using grammar, principal components and Bayesian network methodologies.

Karacaören Burak B   Silander Tomi T   Alvarez-Castro José M JM   Haley Chris S CS   de Koning Dirk Jan DJ  

BMC proceedings 20110527


<h4>Background</h4>It has been shown that if genetic relationships among individuals are not taken into account for genome wide association studies, this may lead to false positives. To address this problem, we used Genome-wide Rapid Association using Mixed Model and Regression and principal component stratification analyses. To account for linkage disequilibrium among the significant markers, principal components loadings obtained from top markers can be included as covariates. Estimation of Ba  ...[more]

Similar Datasets

| S-EPMC2367470 | biostudies-literature
| S-EPMC3328249 | biostudies-literature
2013-03-14 | E-GEOD-26520 | biostudies-arrayexpress
2013-03-14 | GSE26520 | GEO
| S-EPMC6039029 | biostudies-literature
| S-EPMC4956302 | biostudies-other
| S-EPMC3392282 | biostudies-literature
| S-EPMC7083277 | biostudies-literature
| S-EPMC7147758 | biostudies-literature
| S-EPMC3626710 | biostudies-literature