Unknown

Dataset Information

0

Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes.


ABSTRACT: We introduce cross-trait penalized regression (CTPR), a powerful and practical approach for multi-trait polygenic risk prediction in large cohorts. Specifically, we propose a novel cross-trait penalty function with the Lasso and the minimax concave penalty (MCP) to incorporate the shared genetic effects across multiple traits for large-sample GWAS data. Our approach extracts information from the secondary traits that is beneficial for predicting the primary trait based on individual-level genotypes and/or summary statistics. Our novel implementation of a parallel computing algorithm makes it feasible to apply our method to biobank-scale GWAS data. We illustrate our method using large-scale GWAS data (~1M SNPs) from the UK Biobank (N = 456,837). We show that our multi-trait method outperforms the recently proposed multi-trait analysis of GWAS (MTAG) for predictive performance. The prediction accuracy for height by the aid of BMI improves from R2 = 35.8% (MTAG) to 42.5% (MCP + CTPR) or 42.8% (Lasso + CTPR) with UK Biobank data.

SUBMITTER: Chung W 

PROVIDER: S-EPMC6361917 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4342297 | biostudies-literature
| S-EPMC6499521 | biostudies-literature
| S-EPMC8636089 | biostudies-literature
| S-EPMC3285536 | biostudies-literature
| S-EPMC3350336 | biostudies-literature
| S-EPMC6191046 | biostudies-literature
| S-EPMC2585631 | biostudies-other
| S-EPMC6996807 | biostudies-literature
| S-EPMC5499546 | biostudies-literature
| S-EPMC6138053 | biostudies-literature