Project description:By relating hundreds of thousands of genotypes to only a few phenotypes, mostly in individuals of one ancestry, genome-wide association (GWA) studies are identifying a rapidly growing number of associations. The Population Architecture using Genomics and Epidemiology (PAGE) Study is designed to further characterize the most promising variants along an epidemiological dimension that is substantial in its sample size, ethnic diversity, breadth of phenotypes, and exposures. PAGE includes scientists and population samples from large ongoing cohort studies: CALiCo (Causal Variants Across the Life Course, a consortium of ARIC, CARDIA, CHS, HCHS/SOL, Strong Heart Cohort Study, Strong Heart Family Study), EAGLE (Epidemiologic Architecture for Genes Linked to Environment, based on 3 National Health and Nutrition Examination Surveys (NHANES), MEC (Multiethnic Cohort) and WHI (Women's Health Initiative), with logistical and scientific support contributed by a Coordinating Center and the NHGRI Office of Population Genomics. These studies combined include over 270,000 participants, and populations represented include Asian Americans, African Americans, European Americans, Hispanic Americans, Native Hawaiians and American Indians. Health outcomes and traits of interest are prioritized, followed by replication of trait-genotype associations and generalization of their effects across population groups and environmental contexts. The first phase of genotyping focused on 901 high-profile SNPs having replicated associations with phenotypes related to diabetes, obesity, cardiovascular disease, lipids, cancers, menopause/menarche, inflammation and autoimmunity that are being genotyped in over 121,000 participants, as available for trait-specific analyses. SNP genotyping, quality control and population-specific association analyses are performed within each cohort, followed by meta-analyses using harmonized phenotypes and standardized analytic methods. The second phase of genotyping will be done using the Metabochip. The Metabochip is a custom large-scale (~200K SNPs) Illumina chip designed for association testing of several metabolic-related phenotypes. Genotyping will be performed at each study site and centralized analysis will be managed through the coordinating center. PAGE is generating 3 types of data: tier 1 (minimally curated phenotypes and analyses, all SNPs by all available phenotypes, used for Phenome-wide association study (PheWAS analysis)), tier 2 (carefully curated analyses of phenotypes available at only one of the four PAGE sites) and tier 3 (carefully curated phenotypes for multi-site analyses, data specifically generated for manuscripts). All tiers of data are submitted to the CC, where they undergo QC prior to submission to dbGaP.

Project description:PAGE II (2013-2017) seeks to expand understanding gained during PAGE I and similar studies of how ancestry-specific differences in allele frequencies and LD may explain differences in risks of common traits and conditions. Recent studies have identified rare genetic variants that are likely to contribute to common diseases and traits and observed that rare variants likely to be functional, such as those in coding and regulatory regions, tend to be population-specific. PAGE II has genotyped over 50,000 samples using MEGA, an Illumina high density custom exomechip array. The MEGA data is being imputed in PAGE to the 1000 Genomes panel. PAGE also sequenced 1,000 samples representative of 21 populations from the Americas. PAGE has harmonized phenotype data for ~300 trait variables. These datasets will be analyzed to continue emphasis on characterizing population-level disease risks in non-European-descent individuals. Cohorts in PAGE II are: CALiCo (Causal Variants Across the Life Course, a consortium of ARIC, CARDIA, HCHS/SOL, Strong Heart Studies), ISMMS (Mount Sinai BioMe Biobank), MEC (Multiethnic Cohort), WHI (Women's Health Initiative), and Stanford University (PAGE Global Reference Panel). Genotyping services were provided by the Center for Inherited Disease Research (CIDR) and sequence data were provided by the McDonnell Genome Institute at Washington University School of Medicine. PAGE I (2008-2013): The first phase of PAGE examined putative causal genetic variants across approximately 100,000 African Americans, Asian Americans, American Indians, European Americans, Hispanic Americans, and Native Hawaiians from four groups representing nine large U.S.-based cohorts. Two genotyping approaches were employed - targeted genotyping of selected SNPs identified in genome-wide association studies of common disease, and a large-scale effort focused on the Metabochip array, which facilitated trans-ethnic fine mapping of several diseases of public health importance. Cohorts in PAGE I were: CALiCo (Causal Variants Across the Life Course, a consortium of ARIC, CARDIA, CHS, HCHS/SOL, Strong Heart Cohort Study, Strong Heart Family Study), EAGLE (Epidemiologic Architecture for Genes Linked to Environment, based on 3 National Health and Nutrition Examination Surveys (NHANES)), MEC (Multiethnic Cohort) and WHI (Women's Health Initiative). Logistical and scientific support is provided by the PAGE Coordinating Center and the NHGRI Division of Genomic Medicine. PAGE II is funded by the NHGRI and the NIMHD. To access PAGE studies currently available in dbGaP, please click on the links below.Please note that some PAGE studies belong to larger cohorts and have been included as PAGE substudies. For those studies, there is an additional link to the parent study. <ul> <li><a href="study.cgi?study_id=phs000223">phs000223</a> PAGE-ARIC and <a href="study.cgi?study_id=phs000280">phs000280</a> ARIC Cohort</li> <li><a href="study.cgi?study_id=phs000236">phs000236</a> PAGE-CARDIA and <a href="study.cgi?study_id=phs000285">phs000285</a> CARDIA Cohort</li> <li><a href="study.cgi?study_id=phs000301">phs000301</a> PAGE-CHS and <a href="study.cgi?study_id=phs000287">phs000287</a> CHS Cohort</li> <li><a href="study.cgi?study_id=phs000559">phs000559</a> PAGE-EAGLE-BioVu</li> <li><a href="study.cgi?study_id=phs000208">phs000208</a> PAGE-EAGLE-NHANES</li> <li><a href="study.cgi?study_id=phs000555">phs000555</a> PAGE-HCHS/SOL and <a href="study.cgi?study_id=phs000810">phs000810</a> HCHS/SOL Cohort</li> <li><a href="study.cgi?study_id=phs000220">phs000220</a> PAGE-MEC</li> <li><a href="study.cgi?study_id=phs000580">phs000580</a> PAGE-SHS and SHFS</li> <li><a href="study.cgi?study_id=phs001033">phs001033</a> PAGE Global Reference Panel</li> <li><a href="study.cgi?study_id=phs000227">phs000227</a> PAGE-WHI and <a href="study.cgi?study_id=phs000200">phs000200</a> WHI Cohort</li> </ul>

Project description:The Multiethnic Cohort (MEC) has established a large biorepository of blood and urine (N=67,000) and cryopreserved lymphocytes (N=15,000) linked to extensive, prospectively collected risk factors (e.g., diet, smoking, physical activity), biomarkers and clinical data for five racial/ethnic groups. This cohort study of over 215,000 men and women in Hawaii and California is unique in that it is population-based and includes large representations of older adults (45-75 yrs at baseline) for five US racial/ethnic groups (Japanese Americans, African Americans, European Americans, Latinos and Native Hawaiians) at varying risks of chronic diseases. Within the PAGE investigation, the MEC proposes to study: 1) diseases for which we have DNA available for large numbers of cases and controls (breast, prostate, and colorectal cancer, diabetes, and obesity); 2) important cancers that are less common (e.g., lung, pancreas, endometrial cancers, NHL) but for which we propose to pool our data with other funded groups; 3) common traits that are risk factors for these diseases (e.g., body mass index/weight, waist-to-hip ratio, height) and 4) relevant disease-associated biomarkers (e.g., fasting insulin and lipids, steroid hormones). The specific aims are: 1) To determine the population-based epidemiologic profile (allele frequency, main effect, heterogeneity by disease characteristics) of putative causal variants in the five racial/ethnic groups in the MEC; 2) for variants displaying effect heterogeneity across ethnic/racial groups, we will utilize differences in LD to identify a more complete spectrum of associated variants at these loci; 3) investigate gene x gene and gene x environment interactions to identify modifiers; 4) examine the associations of putative causal variants with already measured intermediate phenotypes (e.g., plasma insulin, lipids, steroid hormones); and 5) for variants that do not fall within known genes, start to investigate their relationships with gene expression and epigenetic patterns in small genomic studies.

Project description:Using a phenome-wide association study (PheWAS) approach, we comprehensively tested genetic variants for association with phenotypes available for 70,061 study participants in the Population Architecture using Genomics and Epidemiology (PAGE) network. Our aim was to better characterize the genetic architecture of complex traits and identify novel pleiotropic relationships. This PheWAS drew on five population-based studies representing four major racial/ethnic groups (European Americans (EA), African Americans (AA), Hispanics/Mexican-Americans, and Asian/Pacific Islanders) in PAGE, each site with measurements for multiple traits, associated laboratory measures, and intermediate biomarkers. A total of 83 single nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) were genotyped across two or more PAGE study sites. Comprehensive tests of association, stratified by race/ethnicity, were performed, encompassing 4,706 phenotypes mapped to 105 phenotype-classes, and association results were compared across study sites. A total of 111 PheWAS results had significant associations for two or more PAGE study sites with consistent direction of effect with a significance threshold of p<0.01 for the same racial/ethnic group, SNP, and phenotype-class. Among results identified for SNPs previously associated with phenotypes such as lipid traits, type 2 diabetes, and body mass index, 52 replicated previously published genotype-phenotype associations, 26 represented phenotypes closely related to previously known genotype-phenotype associations, and 33 represented potentially novel genotype-phenotype associations with pleiotropic effects. The majority of the potentially novel results were for single PheWAS phenotype-classes, for example, for CDKN2A/B rs1333049 (previously associated with type 2 diabetes in EA) a PheWAS association was identified for hemoglobin levels in AA. Of note, however, GALNT2 rs2144300 (previously associated with high-density lipoprotein cholesterol levels in EA) had multiple potentially novel PheWAS associations, with hypertension related phenotypes in AA and with serum calcium levels and coronary artery disease phenotypes in EA. PheWAS identifies associations for hypothesis generation and exploration of the genetic architecture of complex traits.

Dataset Information

Population Architecture using Genomics and Epidemiology (PAGE)

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets