Unknown

Dataset Information

0

Improving polygenic risk prediction from summary statistics by an empirical Bayes approach.


ABSTRACT: Polygenic risk scores (PRS) from genome-wide association studies (GWAS) are increasingly used to predict disease risks. However some included variants could be false positives and the raw estimates of effect sizes from them may be subject to selection bias. In addition, the standard PRS approach requires testing over a range of p-value thresholds, which are often chosen arbitrarily. The prediction error estimated from the optimized threshold may also be subject to an optimistic bias. To improve genomic risk prediction, we proposed new empirical Bayes approaches to recover the underlying effect sizes and used them as weights to construct PRS. We applied the new PRS to twelve cardio-metabolic traits in the Northern Finland Birth Cohort and demonstrated improvements in predictive power (in R2) when compared to standard PRS at the best p-value threshold. Importantly, for eleven out of the twelve traits studied, the predictive performance from the entire set of genome-wide markers outperformed the best R2 from standard PRS at optimal p-value thresholds. Our proposed methodology essentially enables an automatic PRS weighting scheme without the need of choosing tuning parameters. The new method also performed satisfactorily in simulations. It is computationally simple and does not require assumptions on the effect size distributions.

SUBMITTER: So HC 

PROVIDER: S-EPMC5286518 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving polygenic risk prediction from summary statistics by an empirical Bayes approach.

So Hon-Cheong HC   Sham Pak C PC  

Scientific reports 20170201


Polygenic risk scores (PRS) from genome-wide association studies (GWAS) are increasingly used to predict disease risks. However some included variants could be false positives and the raw estimates of effect sizes from them may be subject to selection bias. In addition, the standard PRS approach requires testing over a range of p-value thresholds, which are often chosen arbitrarily. The prediction error estimated from the optimized threshold may also be subject to an optimistic bias. To improve  ...[more]

Similar Datasets

| S-EPMC7332650 | biostudies-literature
| S-EPMC6841727 | biostudies-literature
| S-EPMC8419981 | biostudies-literature
| S-EPMC8206385 | biostudies-literature
| S-EPMC4666841 | biostudies-literature
| S-EPMC8609771 | biostudies-literature
| S-EPMC7553329 | biostudies-literature
| S-EPMC6366007 | biostudies-literature