Unknown

Dataset Information

0

A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).


ABSTRACT: Genome-wide association studies (GWAS) are susceptible to bias due to population stratification (PS). The most widely used method to correct bias due to PS is principal components (PCs) analysis (PCA), but there is no objective method to guide which PCs to include as covariates. Often, the ten PCs with the highest eigenvalues are included to adjust for PS. This selection is arbitrary, and patterns of local linkage disequilibrium may affect PCA corrections. To address these limitations, we estimate genomic propensity scores based on all statistically significant PCs selected by the Tracy-Widom (TW) statistic. We compare a principal components and propensity scores (PCAPS) approach to PCA and EMMAX using simulated GWAS data under no, moderate, and severe PS. PCAPS reduced spurious genetic associations regardless of the degree of PS, resulting in odds ratio (OR) estimates closer to the true OR. We illustrate our PCAPS method using GWAS data from a study of testicular germ cell tumors. PCAPS provided a more conservative adjustment than PCA. Advantages of the PCAPS approach include reduction of bias compared to PCA, consistent selection of propensity scores to adjust for PS, the potential ability to handle outliers, and ease of implementation using existing software packages.

SUBMITTER: Zhao H 

PROVIDER: S-EPMC6475581 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).

Zhao Huaqing H   Mitra Nandita N   Kanetsky Peter A PA   Nathanson Katherine L KL   Rebbeck Timothy R TR  

Statistical applications in genetics and molecular biology 20181204 6


Genome-wide association studies (GWAS) are susceptible to bias due to population stratification (PS). The most widely used method to correct bias due to PS is principal components (PCs) analysis (PCA), but there is no objective method to guide which PCs to include as covariates. Often, the ten PCs with the highest eigenvalues are included to adjust for PS. This selection is arbitrary, and patterns of local linkage disequilibrium may affect PCA corrections. To address these limitations, we estima  ...[more]

Similar Datasets

| S-EPMC3864649 | biostudies-literature
| S-EPMC6456307 | biostudies-literature
| S-EPMC7612316 | biostudies-literature
| S-EPMC3480088 | biostudies-literature
| S-EPMC3117098 | biostudies-literature
| S-EPMC4224995 | biostudies-literature
| S-EPMC2941459 | biostudies-literature
| S-EPMC4706946 | biostudies-literature
| S-EPMC3365598 | biostudies-literature