Dataset Information

A principal component approach to improve association testing with polygenic risk scores.

ABSTRACT: Polygenic risk scores (PRSs) have become an increasingly popular approach for demonstrating polygenic influences on complex traits and for establishing common polygenic signals between different traits. PRSs are typically constructed using pruning and thresholding (P+T), but the best choice of parameters is uncertain; thus multiple settings are used and the best is chosen. Optimization can lead to inflated Type I error. Permutation procedures can correct this, but they can be computationally intensive. Alternatively, a single parameter setting can be chosen a priori for the PRS, but choosing suboptimal settings results in loss of power. We propose computing PRSs under a range of parameter settings, performing principal component analysis (PCA) on the resulting set of PRSs, and using the first PRS-PC in association tests. The first PC reweights the variants included in the PRS to achieve maximum variation over all PRS settings used. Using simulations and a real data application to study PRS association with bipolar disorder and psychosis in bipolar disorder, we compare the performance of the proposed PRS-PCA approach with a permutation test and an a priori selected p-value threshold. The PRS-PCA approach is simple to implement, outperforms the other strategies in most scenarios, and provides an unbiased estimate of prediction performance.

SUBMITTER: Coombes BJ

PROVIDER: S-EPMC7722089 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A principal component approach to improve association testing with polygenic risk scores.

Coombes Brandon J BJ Ploner Alexander A Bergen Sarah E SE Biernacka Joanna M JM

Genetic epidemiology 20200721 7

Polygenic risk scores (PRSs) have become an increasingly popular approach for demonstrating polygenic influences on complex traits and for establishing common polygenic signals between different traits. PRSs are typically constructed using pruning and thresholding (P+T), but the best choice of parameters is uncertain; thus multiple settings are used and the best is chosen. Optimization can lead to inflated Type I error. Permutation procedures can correct this, but they can be computationally int ...[more]

PMID: 32691445

Dataset Information

A principal component approach to improve association testing with polygenic risk scores.

Publications

A principal component approach to improve association testing with polygenic risk scores.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts.
| S-EPMC9340200 | biostudies-literature

Polygenic transcriptome risk scores (PTRS) can improve portability of polygenic risk scores across ancestries.
| S-EPMC8759285 | biostudies-literature

Multiethnic polygenic risk scores improve risk prediction in diverse populations.
| S-EPMC5726434 | biostudies-literature

Imputed gene expression risk scores: a functionally informed component of polygenic risk.
| S-EPMC8127405 | biostudies-literature

Independent component analysis of SNPs reflects polygenic risk scores for schizophrenia.
| S-EPMC5348276 | biostudies-literature

Adaptive elastic-net sparse principal component analysis for pathway association testing.
| S-EPMC3215429 | biostudies-literature

Association Between Polygenic Risk Scores and Outcome of ECT.
| S-EPMC10113810 | biostudies-literature

Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies.
| S-EPMC7039528 | biostudies-literature

FUNCTIONAL PRINCIPAL VARIANCE COMPONENT TESTING FOR A GENETIC ASSOCIATION STUDY OF HIV PROGRESSION.
| S-EPMC7111467 | biostudies-literature

A flexible and parallelizable approach to genome-wide polygenic risk scores.
| S-EPMC6764842 | biostudies-literature