Dataset Information

A flexible and parallelizable approach to genome-wide polygenic risk scores.

ABSTRACT: The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic risk scores from meta-GWAS summary statistics. Local linkage disequilibrium (LD) is adjusted for in Step 1, followed by, uniquely, long-range LD in Step 2. Our algorithm is highly parallelizable since block-wise analyses in Step 1 can be distributed across a high-performance computing cluster, and flexible, since sparsity and heritability are estimated within each block. Inference is obtained through a formal Bayesian variable selection framework, meaning final risk predictions are averaged over competing models. We compared our method to two alternative approaches: LDPred and lassosum using all seven traits in the Welcome Trust Case Control Consortium as well as meta-GWAS summaries for type 1 diabetes (T1D), coronary artery disease, and schizophrenia. Performance was generally similar across methods, although our framework provided more accurate predictions for T1D, for which there are multiple heterogeneous signals in regions of both short- and long-range LD. With sufficient compute resources, our method also allows the fastest runtimes.

SUBMITTER: Newcombe PJ

PROVIDER: S-EPMC6764842 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A flexible and parallelizable approach to genome-wide polygenic risk scores.

Newcombe Paul J PJ Nelson Christopher P CP Samani Nilesh J NJ Dudbridge Frank F

Genetic epidemiology 20190722 7

The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic risk scores from meta-GWAS summary statistics. Local linkage disequilibrium (LD) is adjusted for in Step 1, followed by, uniquely, long-range LD in Step 2. Our algorithm is highly p ...[more]

PMID: 31328830

Similar Datasets

Project description:Aggression and callous, uncaring, and unemotional (CU) traits are clinically related behavioral constructs caused by genetic and environmental factors. We performed polygenic risk score (PRS) analyses to investigate shared genetic etiology between aggression and these three CU-traits. Furthermore, we studied interactions of PRS with smoking during pregnancy and childhood life events in relation to CU-traits. Summary statistics for the base phenotype were derived from the EAGLE-consortium genome-wide association study of children's aggressive behavior and were used to calculate individual-level genome-wide and gene-set PRS in the NeuroIMAGE target-sample. Target phenotypes were 'callousness', 'uncaring', and 'unemotional' sumscores of the Inventory of Callous-Unemotional traits. A total of 779 subjects and 1,192,414 single-nucleotide polymorphisms were available for PRS-analyses. Gene-sets comprised serotonergic, dopaminergic, glutamatergic, and neuroendocrine signaling pathways. Genome-wide PRS showed evidence of association with uncaring scores (explaining up to 1.59% of variance; self-contained Q = 0.0306, competitive-P = 0.0015). Dopaminergic, glutamatergic, and neuroendocrine PRS showed evidence of association with unemotional scores (explaining up to 1.33, 2.00, and 1.20% of variance respectively; self-contained Q-values 0.037, 0.0115, and 0.0473 respectively, competitive-P-values 0.0029, 0.0002, and 0.0045 respectively). Smoking during pregnancy related to callousness scores while childhood life events related to both callousness and unemotionality. Moreover, dopaminergic PRS appeared to interact with childhood life events in relation to unemotional scores. Our study provides evidence suggesting shared genetic etiology between aggressive behavior and uncaring, and unemotional CU-traits in children. Gene-set PRS confirmed involvement of shared glutamatergic, dopaminergic, and neuroendocrine genetic variation in aggression and CU-traits. Replication of current findings is needed.

Dataset Information

A flexible and parallelizable approach to genome-wide polygenic risk scores.

Publications

A flexible and parallelizable approach to genome-wide polygenic risk scores.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets