Unknown

Dataset Information

0

European American stratification in ovarian cancer case control data: the utility of genome-wide data for inferring ancestry.


ABSTRACT: We investigated the ability of several principal components analysis (PCA)-based strategies to detect and control for population stratification using data from a multi-center study of epithelial ovarian cancer among women of European-American ethnicity. These include a correction based on an ancestry informative markers (AIMs) panel designed to capture European ancestral variation and corrections utilizing un-thinned genome-wide SNP data; case-control samples were drawn from four geographically distinct North-American sites. The AIMs-only and genome-wide first principal components (PC1) both corresponded to the previously described North or Northwest-Southeast axis of European variation. We found that the genome-wide PCA captured this primary dimension of variation more precisely and identified additional axes of genome-wide variation of relevance to epithelial ovarian cancer. Associations evident between the genome-wide PCs and study site corroborate North American immigration history and suggest that undiscovered dimensions of variation lie within Northern Europe. The structure captured by the genome-wide PCA was also found within control individuals and did not reflect the case-control variation present in the data. The genome-wide PCA highlighted three regions of local LD, corresponding to the lactase (LCT) gene on chromosome 2, the human leukocyte antigen system (HLA) on chromosome 6 and to a common inversion polymorphism on chromosome 8. These features did not compromise the efficacy of PCs from this analysis for ancestry control. This study concludes that although AIMs panels are a cost-effective way of capturing population structure, genome-wide data should preferably be used when available.

SUBMITTER: Raska P 

PROVIDER: S-EPMC3348917 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

European American stratification in ovarian cancer case control data: the utility of genome-wide data for inferring ancestry.

Raska Paola P   Iversen Edwin E   Chen Ann A   Chen Zhihua Z   Fridley Brooke L BL   Permuth-Wey Jennifer J   Tsai Ya-Yu YY   Vierkant Robert A RA   Goode Ellen L EL   Risch Harvey H   Schildkraut Joellen M JM   Sellers Thomas A TA   Barnholtz-Sloan Jill J  

PloS one 20120509 5


We investigated the ability of several principal components analysis (PCA)-based strategies to detect and control for population stratification using data from a multi-center study of epithelial ovarian cancer among women of European-American ethnicity. These include a correction based on an ancestry informative markers (AIMs) panel designed to capture European ancestral variation and corrections utilizing un-thinned genome-wide SNP data; case-control samples were drawn from four geographically  ...[more]

Similar Datasets

| S-EPMC7449501 | biostudies-literature
| S-EPMC1852743 | biostudies-literature
| S-EPMC5381037 | biostudies-literature
| S-EPMC5267476 | biostudies-literature
2021-01-01 | GSE156970 | GEO
| S-EPMC3046166 | biostudies-literature
2009-07-28 | GSE17356 | GEO
2021-01-01 | GSE156969 | GEO