Unknown

Dataset Information

0

A data-adaptive Bayesian regression approach for polygenic risk prediction.


ABSTRACT:

Motivation

Polygenic risk score (PRS) has been widely exploited for genetic risk prediction due to its accuracy and conceptual simplicity. We introduce a unified Bayesian regression framework, NeuPred, for PRS construction, which accommodates varying genetic architectures and improves overall prediction accuracy for complex diseases by allowing for a wide class of prior choices. To take full advantage of the framework, we propose a summary-statistics-based cross-validation strategy to automatically select suitable chromosome-level priors, which demonstrates a striking variability of the prior preference of each chromosome, for the same complex disease, and further significantly improves the prediction accuracy.

Results

Simulation studies and real data applications with seven disease datasets from the Wellcome Trust Case Control Consortium cohort and eight groups of large-scale genome-wide association studies demonstrate that NeuPred achieves substantial and consistent improvements in terms of predictive r2 over existing methods. In addition, NeuPred has similar or advantageous computational efficiency compared with the state-of-the-art Bayesian methods.

Availability and implementation

The R package implementing NeuPred is available at https://github.com/shuangsong0110/NeuPred.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Song S 

PROVIDER: S-EPMC8963326 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A data-adaptive Bayesian regression approach for polygenic risk prediction.

Song Shuang S   Hou Lin L   Liu Jun S JS  

Bioinformatics (Oxford, England) 20220301 7


<h4>Motivation</h4>Polygenic risk score (PRS) has been widely exploited for genetic risk prediction due to its accuracy and conceptual simplicity. We introduce a unified Bayesian regression framework, NeuPred, for PRS construction, which accommodates varying genetic architectures and improves overall prediction accuracy for complex diseases by allowing for a wide class of prior choices. To take full advantage of the framework, we propose a summary-statistics-based cross-validation strategy to au  ...[more]

Similar Datasets

| S-EPMC6467998 | biostudies-literature
| S-EPMC10705229 | biostudies-literature
| S-EPMC6841727 | biostudies-literature
| S-EPMC10055041 | biostudies-literature
| S-EPMC11271575 | biostudies-literature
| S-EPMC11834989 | biostudies-literature
| S-EPMC10120638 | biostudies-literature
| S-EPMC10656345 | biostudies-literature
| S-EPMC11684611 | biostudies-literature
| S-EPMC5638690 | biostudies-literature