Unknown

Dataset Information

0

Resource profile and user guide of the Polygenic Index Repository.


ABSTRACT: Polygenic indexes (PGIs) are DNA-based predictors. Their value for research in many scientific disciplines is growing rapidly. As a resource for researchers, we used a consistent methodology to construct PGIs for 47 phenotypes in 11 datasets. To maximize the PGIs' prediction accuracies, we constructed them using genome-wide association studies-some not previously published-from multiple data sources, including 23andMe and UK Biobank. We present a theoretical framework to help interpret analyses involving PGIs. A key insight is that a PGI can be understood as an unbiased but noisy measure of a latent variable we call the 'additive SNP factor'. Regressions in which the true regressor is this factor but the PGI is used as its proxy therefore suffer from errors-in-variables bias. We derive an estimator that corrects for the bias, illustrate the correction, and make a Python tool for implementing it publicly available.

SUBMITTER: Becker J 

PROVIDER: S-EPMC8678380 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2784028 | biostudies-literature
| S-EPMC9438378 | biostudies-literature
| S-EPMC11293310 | biostudies-literature
| S-EPMC9568819 | biostudies-literature
| S-EPMC8725060 | biostudies-literature
2023-06-06 | E-MTAB-11770 | biostudies-arrayexpress
| S-EPMC4383955 | biostudies-literature
| S-EPMC9291099 | biostudies-literature
| S-EPMC7340791 | biostudies-literature