Project description:Human longevity is heritable, but genome-wide association (GWA) studies have had limited success. Here, we perform two meta-analyses of GWA studies of a rigorous longevity phenotype definition including 11,262/3484 cases surviving at or beyond the age corresponding to the 90th/99th survival percentile, respectively, and 25,483 controls whose age at death or at last contact was at or below the age corresponding to the 60th survival percentile. Consistent with previous reports, rs429358 (apolipoprotein E (ApoE) ε4) is associated with lower odds of surviving to the 90th and 99th percentile age, while rs7412 (ApoE ε2) shows the opposite. Moreover, rs7676745, located near GPR78, associates with lower odds of surviving to the 90th percentile age. Gene-level association analysis reveals a role for tissue-specific expression of multiple genes in longevity. Finally, genetic correlation of the longevity GWA results with that of several disease-related phenotypes points to a shared genetic architecture between health and longevity.
Project description:BackgroundThe search for statistically significant relationships between molecular markers and outcomes is challenging when dealing with high-dimensional, noisy and collinear multivariate omics data, such as metabolomic profiles. Permutation procedures allow for the estimation of adjusted significance levels without assuming independence among metabolomic variables. Nevertheless, the complex non-normal structure of metabolic profiles and outcomes may bias the permutation results leading to overly conservative threshold estimates i.e. lower than those from a Bonferroni or Sidak correction.MethodsWithin a univariate permutation procedure we employ parametric simulation methods based on the multivariate (log-)Normal distribution to obtain adjusted significance levels which are consistent across different outcomes while effectively controlling the type I error rate. Next, we derive an alternative closed-form expression for the estimation of the number of non-redundant metabolic variates based on the spectral decomposition of their correlation matrix. The performance of the method is tested for different model parametrizations and across a wide range of correlation levels of the variates using synthetic and real data sets.ResultsBoth the permutation-based formulation and the more practical closed form expression are found to give an effective indication of the number of independent metabolic effects exhibited by the system, while guaranteeing that the derived adjusted threshold is stable across outcome measures with diverse properties.
Project description:Despite enormous research efforts, the genetic component of longevity has remained largely elusive. The investigation of common variants, mainly located in intronic or regulatory regions, has yielded only little new information on the heritability of the phenotype. Here, we performed a chip-based exome-wide association study investigating 62 488 common and rare coding variants in 1248 German long-lived individuals, including 599 centenarians and 6941 younger controls (age < 60 years). In a single-variant analysis, we observed an exome-wide significant association between rs1046896 in the gene fructosamine-3-kinase-related-protein (FN3KRP) and longevity. Noteworthy, we found the longevity allele C of rs1046896 to be associated with an increased FN3KRP expression in whole blood; a database look-up confirmed this effect for various other human tissues. A gene-based analysis, in which potential cumulative effects of common and rare variants were considered, yielded the gene phosphoglycolate phosphatase (PGP) as another potential longevity gene, though no single variant in PGP reached the discovery p-value (1 × 10E-04). Furthermore, we validated the previously reported longevity locus cyclin-dependent kinase inhibitor 2B antisense RNA 1 (CDKN2B-AS1). Replication of our results in a French longevity cohort was only successful for rs1063192 in CDKN2B-AS1. In conclusion, we identified 2 new potential candidate longevity genes, FN3KRP and PGP which may influence the phenotype through their role in metabolic processes, that is, the reverse glycation of proteins (FN3KRP) and the control of glycerol-3-phosphate levels (PGP).
Project description:Genome-wide association studies (GWAS) of lung cancer in Asian never-smoking women have previously identified six susceptibility loci associated with lung cancer risk. To further discover new susceptibility loci, we imputed data from four GWAS of Asian non-smoking female lung cancer (6877 cases and 6277 controls) using the 1000 Genomes Project (Phase 1 Release 3) data as the reference and genotyped additional samples (5878 cases and 7046 controls) for possible replication. In our meta-analysis, three new loci achieved genome-wide significance, marked by single nucleotide polymorphism (SNP) rs7741164 at 6p21.1 (per-allele odds ratio (OR) = 1.17; P = 5.8 × 10(-13)), rs72658409 at 9p21.3 (per-allele OR = 0.77; P = 1.41 × 10(-10)) and rs11610143 at 12q13.13 (per-allele OR = 0.89; P = 4.96 × 10(-9)). These findings identified new genetic susceptibility alleles for lung cancer in never-smoking women in Asia and merit follow-up to understand their biological underpinnings.
Project description:The international Testicular Cancer Consortium (TECAC) combined five published genome-wide association studies of testicular germ cell tumor (TGCT; 3,558 cases and 13,970 controls) to identify new susceptibility loci. We conducted a fixed-effects meta-analysis, including, to our knowledge, the first analysis of the X chromosome. Eight new loci mapping to 2q14.2, 3q26.2, 4q35.2, 7q36.3, 10q26.13, 15q21.3, 15q22.31, and Xq28 achieved genome-wide significance (P < 5 × 10-8). Most loci harbor biologically plausible candidate genes. We refined previously reported associations at 9p24.3 and 19p12 by identifying one and three additional independent SNPs, respectively. In aggregate, the 39 independent markers identified to date explain 37% of father-to-son familial risk, 8% of which can be attributed to the 12 new signals reported here. Our findings substantially increase the number of known TGCT susceptibility alleles, move the field closer to a comprehensive understanding of the underlying genetic architecture of TGCT, and provide further clues to the etiology of TGCT.
Project description:BackgroundWhile genome-wide association studies (GWAS) of multiple myeloma (MM) have identified variants at 23 regions influencing risk, the genes underlying these associations are largely unknown. To identify candidate causal genes at these regions and search for novel risk regions, we performed a multi-tissue transcriptome-wide association study (TWAS).ResultsGWAS data on 7319 MM cases and 234,385 controls was integrated with Genotype-Tissue Expression Project (GTEx) data assayed in 48 tissues (sample sizes, N = 80-491), including lymphocyte cell lines and whole blood, to predict gene expression. We identified 108 genes at 13 independent regions associated with MM risk, all of which were in 1 Mb of known MM GWAS risk variants. Of these, 94 genes, located in eight regions, had not previously been considered as a candidate gene for that locus.ConclusionsOur findings highlight the value of leveraging expression data from multiple tissues to identify candidate genes responsible for GWAS associations which provide insight into MM tumorigenesis. Among the genes identified, a number have plausible roles in MM biology, notably APOBEC3C, APOBEC3H, APOBEC3D, APOBEC3F, APOBEC3G, or have been previously implicated in other malignancies. The genes identified in this TWAS can be explored for follow-up and validation to further understand their role in MM biology.
Project description:EPIC array data were generated from 2 MDD case control cohorts. EWAS was performed in each cohort, followed by meta-analysis between the 2 cohort. Cohort 1: A total of 191 blood samples from 112 patients with MDD was collected up till the interim analysis (wave 1 samples) from an observational clinical study OBSERVEMDD0001 (ClinicalTrials.gov Identifier: NCT02489305) compared to 32 healthy controls; Cohort 2: The MDD cases (N = 359) were drawn from the Molecular Biomarkers of Antidepressant Response study compared to 68 healthy controls.
Project description:BACKGROUND:C-reactive protein (CRP) is a heritable marker of chronic inflammation that is strongly associated with cardiovascular disease. We sought to identify genetic variants that are associated with CRP levels. METHODS AND RESULTS:We performed a genome-wide association analysis of CRP in 66 185 participants from 15 population-based studies. We sought replication for the genome-wide significant and suggestive loci in a replication panel comprising 16 540 individuals from 10 independent studies. We found 18 genome-wide significant loci, and we provided evidence of replication for 8 of them. Our results confirm 7 previously known loci and introduce 11 novel loci that are implicated in pathways related to the metabolic syndrome (APOC1, HNF1A, LEPR, GCKR, HNF4A, and PTPN2) or the immune system (CRP, IL6R, NLRP3, IL1F10, and IRF1) or that reside in regions previously not known to play a role in chronic inflammation (PPP1R3B, SALL1, PABPC4, ASCL1, RORA, and BCL7B). We found a significant interaction of body mass index with LEPR (P<2.9×10(-6)). A weighted genetic risk score that was developed to summarize the effect of risk alleles was strongly associated with CRP levels and explained ?5% of the trait variance; however, there was no evidence for these genetic variants explaining the association of CRP with coronary heart disease. CONCLUSIONS:We identified 18 loci that were associated with CRP levels. Our study highlights immune response and metabolic regulatory pathways involved in the regulation of chronic inflammation.
Project description:Allergen-specific immunoglobulin E (present in allergic sensitization) has a central role in the pathogenesis of allergic disease. We performed the first large-scale genome-wide association study (GWAS) of allergic sensitization in 5,789 affected individuals and 10,056 controls and followed up the top SNP at each of 26 loci in 6,114 affected individuals and 9,920 controls. We increased the number of susceptibility loci with genome-wide significant association with allergic sensitization from three to ten, including SNPs in or near TLR6, C11orf30, STAT6, SLC25A46, HLA-DQB1, IL1RL1, LPP, MYC, IL2 and HLA-B. All the top SNPs were associated with allergic symptoms in an independent study. Risk-associated variants at these ten loci were estimated to account for at least 25% of allergic sensitization and allergic rhinitis. Understanding the molecular mechanisms underlying these associations may provide new insights into the etiology of allergic disease.
Project description:The search for the genetic determinants of extreme human longevity has been challenged by the phenotype's rarity and its nonspecific definition by investigators. To address these issues, we established a consortium of four studies of extreme longevity that contributed 2,070 individuals who survived to the oldest one percentile of survival for the 1900 U.S. birth year cohort. We conducted various analyses to discover longevity-associated variants (LAV) and characterized those LAVs that differentiate survival to extreme age at death (eSAVs) from those LAVs that become more frequent in centenarians because of mortality selection (eg, survival to younger years). The analyses identified new rare variants in chromosomes 4 and 7 associated with extreme survival and with reduced risk for cardiovascular disease and Alzheimer's disease. The results confirm the importance of studying truly rare survival to discover those combinations of common and rare variants associated with extreme longevity and longer health span.