Project description:Population-based genome wide association studies have identified a locus at 9p22.2 associated with ovarian cancer risk, which also modifies ovarian cancer risk in BRCA1 and BRCA2 mutation carriers. We conducted fine-scale mapping at 9p22.2 to identify potential causal variants in BRCA1 and BRCA2 mutation carriers. Genotype data were available for 15,252 (2,462 ovarian cancer cases) BRCA1 and 8,211 (631 ovarian cancer cases) BRCA2 mutation carriers. Following genotype imputation, ovarian cancer associations were assessed for 4,873 and 5,020 SNPs in BRCA1 and BRCA 2 mutation carriers respectively, within a retrospective cohort analytical framework. In BRCA1 mutation carriers one set of eight correlated candidate causal variants for ovarian cancer risk modification was identified (top SNP rs10124837, HR: 0.73, 95%CI: 0.68 to 0.79, p-value 2× 10-16). These variants were located up to 20 kb upstream of BNC2. In BRCA2 mutation carriers one region, up to 45 kb upstream of BNC2, and containing 100 correlated SNPs was identified as candidate causal (top SNP rs62543585, HR: 0.69, 95%CI: 0.59 to 0.80, p-value 1.0 × 10-6). The candidate causal in BRCA1 mutation carriers did not include the strongest associated variant at this locus in the general population. In sum, we identified a set of candidate causal variants in a region that encompasses the BNC2 transcription start site. The ovarian cancer association at 9p22.2 may be mediated by different variants in BRCA1 mutation carriers and in the general population. Thus, potentially different mechanisms may underlie ovarian cancer risk for mutation carriers and the general population.
Project description:Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.
Project description:Migraine is a highly prevalent neurovascular disorder for which genome-wide association studies (GWAS) have identified over one hundred risk loci, yet the causal variants and genes remain mostly unknown. Here, we meta-analyzed three migraine GWAS including 98,374 cases and 869,160 controls and identified 122 independent risk loci of which 35 were new. Fine-mapping of a meta-analysis is challenging because some variants may be missing from some participating studies and accurate linkage disequilibrium (LD) information of the variants is often not available. Here, using the exact in-sample LD, we first investigated which statistics could reliably capture the quality of fine-mapping when only reference LD was available. We observed that the posterior expected number of causal variants best distinguished between the high- and low-quality results. Next, we performed fine-mapping for 102 autosomal risk regions using FINEMAP. We produced high-quality fine-mapping for 93 regions and defined 181 distinct credible sets. Among the high-quality credible sets were 7 variants with very high posterior inclusion probability (PIP > 0.9) and 2 missense variants with PIP > 0.5 (rs6330 in NGF and rs1133400 in INPP5A). For 35 association signals, we managed to narrow down the set of potential risk variants to at most 5 variants.
Project description:Recent genome-wide association studies have identified 78 loci associated with Parkinson's disease susceptibility but the underlying mechanisms remain largely unclear. To identify likely causal variants for disease risk, we fine-mapped these Parkinson's-associated loci using four different fine-mapping methods. We then integrated multi-assay cell type-specific epigenomic profiles to pinpoint the likely mechanism of action of each variant, allowing us to identify Consensus single nucleotide polymorphism (SNPs) that disrupt LRRK2 and FCGR2A regulatory elements in microglia, an MBNL2 enhancer in oligodendrocytes, and a DYRK1A enhancer in neurons. This genome-wide functional fine-mapping investigation of Parkinson's disease substantially advances our understanding of the causal mechanisms underlying this complex disease while avoiding focus on spurious, non-causal mechanisms. Together, these results provide a robust, comprehensive list of the likely causal variants, genes and cell-types underlying Parkinson's disease risk as demonstrated by consistently greater enrichment of our fine-mapped SNPs relative to lead GWAS SNPs across independent functional impact annotations. In addition, our approach prioritized an average of 3/85 variants per locus as putatively causal, making downstream experimental studies both more tractable and more likely to yield disease-relevant, actionable results. Large-scale studies comparing individuals with Parkinson's disease to age-matched controls have identified many regions of the genome associated with the disease. However, there is widespread correlation between different parts of the genome, making it difficult to tell which genetic variants cause Parkinson's and which are simply co-inherited with causal variants. We therefore applied a suite of statistical models to identify the most likely causal genetic variants (i.e. fine-mapping). We then linked these genetic variants with epigenomic and gene expression signatures across a wide variety of tissues and cell types to identify how these variants cause disease. Therefore, this study provides a comprehensive and robust list of cellular and molecular mechanisms that may serve as targets in the development of more effective Parkinson's therapeutics.
Project description:Advancing from statistical associations of complex traits with genetic markers to understanding the functional genetic variants that influence traits is often a complex process. Fine-mapping can select and prioritize genetic variants for further study, yet the multitude of analytical strategies and study designs makes it challenging to choose an optimal approach. We review the strengths and weaknesses of different fine-mapping approaches, emphasizing the main factors that affect performance. Topics include interpreting results from genome-wide association studies (GWAS), the role of linkage disequilibrium, statistical fine-mapping approaches, trans-ethnic studies, genomic annotation and data integration, and other analysis and design issues.
Project description:IntroductionLarge-scale genome-wide association studies (GWASs) have identified 97 chromosomal loci associated with increased body mass index in population-based studies on adults. One of these SNPs, rs7359397, tags a large region (approx. 1MB) with high linkage disequilibrium (r2>0.7), which comprises five genes (SH2B1, APOBR, sulfotransferases: SULT1A1 and SULT1A2, TUFM). We had previously described a rare mutation in SH2B1 solely identified in extremely obese individuals but not in lean controls.MethodsThe coding regions of the genes APOBR, SULT1A1, SULT1A2, and TUFM were screened for mutations (dHPLC, SSCP, Sanger re-sequencing) in 95 extremely obese children and adolescents. Detected non-synonymous variants were genotyped (TaqMan SNP Genotyping, MALDI TOF, PCR-RFLP) in independent large study groups (up to 3,210 extremely obese/overweight cases, 485 lean controls and 615 obesity trios). In silico tools were used for the prediction of potential functional effects of detected variants.ResultsExcept for TUFM we detected non-synonymous variants in all screened genes. Two polymorphisms rs180743 (APOBR p.Pro428Ala) and rs3833080 (APOBR p.Gly369_Asp370del9) showed nominal association to (extreme) obesity (uncorrected p = 0.003 and p = 0.002, respectively). In silico analyses predicted a functional implication for rs180743 (APOBR p.Pro428Ala). Both APOBR variants are located in the repetitive region with unknown function.ConclusionVariants in APOBR contributed as strongly as variants in SH2B1 to the association with extreme obesity in the chromosomal region chr16p11.2. In silico analyses implied no functional effect of several of the detected variants. Further in vitro or in vivo analyses on the functional implications of the obesity associated variants are warranted.
Project description:Docking the tails of lambs in long-tailed sheep breeds is a common practice worldwide. But this practice is associated with pain. Breeding for a shorter tail could offer an alternative. Therefore, this study aimed to analyze the natural tail length variation in the Merinolandschaf and to identify causal alleles for the short tail phenotype segregating within long-tailed breeds. We used SNP-based association analysis and haplotype-based mapping in 362 genotyped (Illumina OvineSNP50) and phenotyped Merinolandschaf lambs. Genome-wide significant regions were capture sequenced in 48 lambs and comparatively analyzed in various long and short-tailed sheep breeds and wild sheep subspecies. Here we show a SNP located in the first exon of HOXB13 and a SINE element located in the promotor of HOXB13 as promising candidates. These results enable more precise breeding towards shorter tails, improve animal welfare by amplification of ancestral alleles and contribute to a better understanding of differential embryonic development.
Project description:Fine-mapping of interesting loci discovered by genome-wide association study (GWAS) is mandatory to pinpoint causal variants. Traditionally, this fine-mapping is completed through increasing the genotyping density at candidate loci, for which imputation is the current standard approach. Although imputation is a useful technique, it has a number of limitations that impede accuracy. In this work, we describe the development of a precise and cost-effective Nanopore sequencing-based pipeline that provides comprehensive and accurate information at candidate loci to identify potential causal single-nucleotide polymorphisms (SNPs). We demonstrate the utility of this technique via the fine-mapping of a GWAS positive hit comprising a synonymous SNP that is associated with doxorubicin-induced cardiotoxicity. In this work, we provide a proof of principle for the application of Nanopore sequencing in post-GWAS fine-mapping and pinpointing of potential causal SNPs with a minimal cost of just ~$10/100 kb/sample.
Project description:Corneal resistance factor (CRF) is altered during corneal diseases progression. Genome-wide-association studies (GWAS) indicated potential CRF and disease genetics overlap. Here, we characterise 135 CRF loci following GWAS in 76029 UK Biobank participants. Enrichment of extra-cellular matrix gene-sets, genetic correlation with corneal thickness (70% (SE = 5%)), reported keratoconus risk variants at 13 loci, all support relevance to corneal stroma biology. Fine-mapping identifies a subset of 55 highly likely causal variants, 91% of which are non-coding. Genomic features enrichments, using all associated variants, also indicate prominent regulatory causal role. We newly established open chromatin landscapes in two widely-used human cornea immortalised cell lines using ATAC-seq. Variants associated with CRF were significantly enriched in regulatory regions from the corneal stroma-derived cell line and enrichment increases to over 5 fold for variants prioritised by fine-mapping-including at GAS7, SMAD3 and COL6A1 loci. Our analysis generates many hypotheses for future functional validation of aetiological mechanisms.
Project description:Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10(-14), odds ratio = 0.86, 95% confidence interval = 0.82-0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression.