Unknown

Dataset Information

0

Genetic disease risks can be misestimated across global populations.


ABSTRACT:

Background

Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results from thousands of genome-wide association studies (GWAS), and extensive computer simulations to identify how genetic disease risks can be misestimated.

Results

In contrast to null expectations, we find that risk allele frequencies at known disease loci are significantly different for African populations compared to other continents. Strikingly, ancestral risk alleles are found at 9.51% higher frequency in Africa, and derived risk alleles are found at 5.40% lower frequency in Africa. By simulating GWAS with different study populations, we find that non-African cohorts yield disease associations that have biased allele frequencies and that African cohorts yield disease associations that are relatively free of bias. We also find empirical evidence that genotyping arrays and SNP ascertainment bias contribute to continental differences in risk allele frequencies. Because of these causes, polygenic risk scores can be grossly misestimated for individuals of African descent. Importantly, continental differences in risk allele frequencies are only moderately reduced if GWAS use whole genome sequences and hundreds of thousands of cases and controls. Finally, comparisons between uncorrected and corrected genetic risk scores reveal the benefits of considering whether risk alleles are ancestral or derived.

Conclusions

Our results imply that caution must be taken when extrapolating GWAS results from one population to predict disease risks in another population.

SUBMITTER: Kim MS 

PROVIDER: S-EPMC6234640 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genetic disease risks can be misestimated across global populations.

Kim Michelle S MS   Patel Kane P KP   Teng Andrew K AK   Berens Ali J AJ   Lachance Joseph J  

Genome biology 20181114 1


<h4>Background</h4>Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results from thousands of genome-wide association studies (GWAS), and extensive computer simulations to identify how genetic disease risks can be misestimated.<h4>Results</h4>In contrast to null expect  ...[more]

Similar Datasets

| S-EPMC8168748 | biostudies-literature
| S-EPMC6413820 | biostudies-literature
| S-EPMC10055601 | biostudies-literature
| S-EPMC8710798 | biostudies-literature
| S-EPMC9831004 | biostudies-literature
| S-EPMC8048360 | biostudies-literature
| S-EPMC3464021 | biostudies-other
| S-EPMC7011927 | biostudies-literature
| S-EPMC4795615 | biostudies-literature
| S-EPMC9191549 | biostudies-literature