Unknown

Dataset Information

0

Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores.


ABSTRACT: Polygenic risk scores suffer reduced accuracy in non-European populations, exacerbating health disparities. We propose PolyPred, a method that improves cross-population polygenic risk scores by combining two predictors: a new predictor that leverages functionally informed fine-mapping to estimate causal effects (instead of tagging effects), addressing linkage disequilibrium differences, and BOLT-LMM, a published predictor. When a large training sample is available in the non-European target population, we propose PolyPred+, which further incorporates the non-European training data. We applied PolyPred to 49 diseases/traits in four UK Biobank populations using UK Biobank British training data, and observed relative improvements versus BOLT-LMM ranging from +7% in south Asians to +32% in Africans, consistent with simulations. We applied PolyPred+ to 23 diseases/traits in UK Biobank east Asians using both UK Biobank British and Biobank Japan training data, and observed improvements of +24% versus BOLT-LMM and +12% versus PolyPred. Summary statistics-based analogs of PolyPred and PolyPred+ attained similar improvements.

SUBMITTER: Weissbrod O 

PROVIDER: S-EPMC9009299 | biostudies-literature | 2022 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores.

Weissbrod Omer O   Kanai Masahiro M   Shi Huwenbo H   Gazal Steven S   Peyrot Wouter J WJ   Khera Amit V AV   Okada Yukinori Y   Martin Alicia R AR   Finucane Hilary K HK   Price Alkes L AL  

Nature genetics 20220407 4


Polygenic risk scores suffer reduced accuracy in non-European populations, exacerbating health disparities. We propose PolyPred, a method that improves cross-population polygenic risk scores by combining two predictors: a new predictor that leverages functionally informed fine-mapping to estimate causal effects (instead of tagging effects), addressing linkage disequilibrium differences, and BOLT-LMM, a published predictor. When a large training sample is available in the non-European target popu  ...[more]

Similar Datasets

| S-EPMC10613261 | biostudies-literature
| S-EPMC10638399 | biostudies-literature
| S-EPMC8759285 | biostudies-literature
| S-EPMC9674947 | biostudies-literature
| S-EPMC7039528 | biostudies-literature
| S-EPMC6323418 | biostudies-literature
| S-EPMC8419981 | biostudies-literature
| S-EPMC8445431 | biostudies-literature
| S-EPMC8059341 | biostudies-literature
| S-EPMC4408026 | biostudies-literature