Following-up genome-wide association study signals: lessons learned from Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study.
Following-up genome-wide association study signals: lessons learned from Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study.
Project description:Venous thromboembolism (VTE) is a common, heritable disease resulting in high rates of hospitalization and mortality. Yet few associations between VTE and genetic variants, all in the coagulation pathway, have been established. To identify additional genetic determinants of VTE, we conducted a two-stage genome-wide association study (GWAS) among individuals of European ancestry in the extended cohorts for heart and aging research in genomic epidemiology (CHARGE) VTE consortium. The discovery GWAS comprised 1,618 incident VTE cases out of 44,499 participants from six community-based studies. Genotypes for genome-wide single-nucleotide polymorphisms (SNPs) were imputed to approximately 2.5 million SNPs in HapMap and association with VTE assessed using study-design appropriate regression methods. Meta-analysis of these results identified two known loci, in F5 and ABO. Top 1,047 tag SNPs (P ≤ 0.0016) from the discovery GWAS were tested for association in an additional 3,231 cases and 3,536 controls from three case-control studies. In the combined data from these two stages, additional genome-wide significant associations were observed on 4q35 at F11 (top SNP rs4253399, intronic to F11) and on 4q28 at FGG (rs6536024, 9.7 kb from FGG; P < 5.0 × 10(-13) for both). The associations at the FGG locus were not completely explained by previously reported variants. Loci at or near SUSD1 and OTUD7A showed borderline yet novel associations (P < 5.0 × 10(-6) ) and constitute new candidate genes. In conclusion, this large GWAS replicated key genetic associations in F5 and ABO, and confirmed the importance of F11 and FGG loci for VTE. Future studies are warranted to better characterize the associations with F11 and FGG and to replicate the new candidate associations.
Project description:BACKGROUND:The pulmonary function measures of forced expiratory volume in 1 second (FEV1) and its ratio to forced vital capacity (FVC) are used in the diagnosis and monitoring of lung diseases and predict cardiovascular mortality in the general population. Genome-wide association studies (GWASs) have identified numerous loci associated with FEV1 and FEV1/FVC, but the causal variants remain uncertain. We hypothesized that novel or rare variants poorly tagged by GWASs may explain the significant associations between FEV1/FVC and 2 genes: ADAM19 and HTR4. METHODS AND RESULTS:We sequenced ADAM19 and its promoter region along with the ?21-kb portion of HTR4 harboring GWAS single-nucleotide polymorphisms for pulmonary function and analyzed associations with FEV1/FVC among 3983 participants of European ancestry from Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium. Meta-analysis of common variants in each region identified statistically significant associations (316 tests; P<1.58×10(-4)) with FEV1/FVC for 14 ADAM19 single-nucleotide polymorphisms and 24 HTR4 single-nucleotide polymorphisms. After conditioning on the sentinel GWASs hit in each gene (ADAM19 rs1422795, minor allele frequency=0.33 and HTR4 rs11168048, minor allele frequency=0.40], 1 single-nucleotide polymorphism remained statistically significant (ADAM19 rs13155908, minor allele frequency=0.12; P=1.56×10(-4)). Analysis of rare variants (minor allele frequency <1%) using sequence kernel association test did not identify associations with either region. CONCLUSIONS:Sequencing identified 1 common variant associated with FEV1/FVC independent of the sentinel ADAM19 GWAS hit and supports the original HTR4 GWAS findings. Rare variants do not seem to underlie GWAS associations with pulmonary function for common variants in ADAM19 and HTR4.
Project description:BACKGROUND:Genome-wide association studies have identified thousands of genetic variants that influence a variety of diseases and health-related quantitative traits. However, the causal variants underlying the majority of genetic associations remain unknown. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study aims to follow up genome-wide association study signals and identify novel associations of the allelic spectrum of identified variants with cardiovascular-related traits. METHODS AND RESULTS:The study included 4231 participants from 3 CHARGE cohorts: the Atherosclerosis Risk in Communities Study, the Cardiovascular Health Study, and the Framingham Heart Study. We used a case-cohort design in which we selected both a random sample of participants and participants with extreme phenotypes for each of 14 traits. We sequenced and analyzed 77 genomic loci, which had previously been associated with ?1 of 14 phenotypes. A total of 52?736 variants were characterized by sequencing and passed our stringent quality control criteria. For common variants (minor allele frequency ?1%), we performed unweighted regression analyses to obtain P values for associations and weighted regression analyses to obtain effect estimates that accounted for the sampling design. For rare variants, we applied 2 approaches: collapsed aggregate statistics and joint analysis of variants using the sequence kernel association test. CONCLUSIONS:We sequenced 77 genomic loci in participants from 3 cohorts. We established a set of filters to identify high-quality variants and implemented statistical and bioinformatics strategies to analyze the sequence data and identify potentially functional variants within genome-wide association study loci.
Project description:BACKGROUND:The primary aim of genome-wide association studies is to identify novel genetic loci associated with interindividual variation in the levels of risk factors, the degree of subclinical disease, or the risk of clinical disease. The requirement for large sample sizes and the importance of replication have served as powerful incentives for scientific collaboration. Methods- The Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium was formed to facilitate genome-wide association studies meta-analyses and replication opportunities among multiple large population-based cohort studies, which collect data in a standardized fashion and represent the preferred method for estimating disease incidence. The design of the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium includes 5 prospective cohort studies from the United States and Europe: the Age, Gene/Environment Susceptibility-Reykjavik Study, the Atherosclerosis Risk in Communities Study, the Cardiovascular Health Study, the Framingham Heart Study, and the Rotterdam Study. With genome-wide data on a total of about 38 000 individuals, these cohort studies have a large number of health-related phenotypes measured in similar ways. For each harmonized trait, within-cohort genome-wide association study analyses are combined by meta-analysis. A prospective meta-analysis of data from all 5 cohorts, with a properly selected level of genome-wide statistical significance, is a powerful approach to finding genuine phenotypic associations with novel genetic loci. CONCLUSIONS:The Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and collaborating non-member studies or consortia provide an excellent framework for the identification of the genetic determinants of risk factors, subclinical-disease measures, and clinical events.
Project description:BackgroundAtherosclerosis, the precursor to coronary heart disease and stroke, is characterized by an accumulation of fatty cells in the arterial intimal-medial layers. Common carotid intima media thickness (cIMT) and plaque are subclinical atherosclerosis measures that predict cardiovascular disease events. Previously, genome-wide association studies demonstrated evidence for association with cIMT (SLC17A4) and plaque (PIK3CG).Methods and resultsWe sequenced 120 kb around SLC17A4 (6p22.2) and 251 kb around PIK3CG (7q22.3) among 3669 European ancestry participants from the Atherosclerosis Risk in Communities (ARIC) study, Cardiovascular Health Study (CHS), and Framingham Heart Study (FHS) in Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium. Primary analyses focused on 438 common variants (minor allele frequency ≥1%), which were independently meta-analyzed. A 3' untranslated region CCDC71L variant (rs2286149), upstream from PIK3CG, was the most significant finding in cIMT (P=0.00033) and plaque (P=0.0004) analyses. A SLC17A4 intronic variant was also associated with cIMT (P=0.008). Both were in low linkage disequilibrium with the genome-wide association study single nucleotide polymorphisms. Gene-based tests including T1 count and sequence kernel association test for rare variants (minor allele frequency <1%) did not yield statistically significant associations. However, we observed nominal associations for rare variants in CCDC71L and SLC17A3 with cIMT and of the entire 7q22 region with plaque (P=0.05).ConclusionsCommon and rare variants in PIK3CG and SLC17A4 regions demonstrated modest association with subclinical atherosclerosis traits. Although not conclusive, these findings may help to understand the genetic architecture of regions previously implicated by genome-wide association studies and identify variants within these regions for further investigation in larger samples.
Project description:BACKGROUND:Genome-wide association studies for body mass index (BMI) previously identified a locus near TMEM18. We conducted targeted sequencing of this region to investigate the role of common, low-frequency, and rare variants influencing BMI. METHODS AND RESULTS:We sequenced TMEM18 and regions downstream of TMEM18 on chromosome 2 in 3976 individuals of European ancestry from 3 community-based cohorts (Atherosclerosis Risk in Communities, Cardiovascular Health Study, and Framingham Heart Study), including 200 adults selected for high BMI. We examined the association between BMI and variants identified in the region from nucleotide position 586?432 to 677?539 (hg18). Rare variants (minor allele frequency, <1%) were analyzed using a burden test and the sequence kernel association test. Results from the 3 cohort studies were meta-analyzed. We estimate that mean BMI is 0.43 kg/m(2) higher for each copy of the G allele of single-nucleotide polymorphism rs7596758 (minor allele frequency, 29%; P=3.46×10(-4)) using a Bonferroni threshold of P<4.6×10(-4). Analyses conditional on previous genome-wide association study single-nucleotide polymorphisms associated with BMI in the region led to attenuation of this signal and uncovered another independent (r(2)<0.2), statistically significant association, rs186019316 (P=2.11×10(-4)). Both rs186019316 and rs7596758 or proxies are located in transcription factor binding regions. No significant association with rare variants was found in either the exons of TMEM18 or the 3' genome-wide association study region. CONCLUSIONS:Targeted sequencing around TMEM18 identified 2 novel BMI variants with possible regulatory function.
Project description:BackgroundStroke, the leading neurologic cause of death and disability, has a substantial genetic component. We previously conducted a genome-wide association study (GWAS) in four prospective studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium and demonstrated that sequence variants near the NINJ2 gene are associated with incident ischemic stroke. Here, we sought to fine-map functional variants in the region and evaluate the contribution of rare variants to ischemic stroke risk.Methods and resultsWe sequenced 196 kb around NINJ2 on chromosome 12p13 among 3,986 European ancestry participants, including 475 ischemic stroke cases, from the Atherosclerosis Risk in Communities Study, Cardiovascular Health Study, and Framingham Heart Study. Meta-analyses of single-variant tests for 425 common variants (minor allele frequency [MAF] ≥ 1%) confirmed the original GWAS results and identified an independent intronic variant, rs34166160 (MAF = 0.012), most significantly associated with incident ischemic stroke (HR = 1.80, p = 0.0003). Aggregating 278 putatively-functional variants with MAF≤ 1% using count statistics, we observed a nominally statistically significant association, with the burden of rare NINJ2 variants contributing to decreased ischemic stroke incidence (HR = 0.81; p = 0.026).ConclusionCommon and rare variants in the NINJ2 region were nominally associated with incident ischemic stroke among a subset of CHARGE participants. Allelic heterogeneity at this locus, caused by multiple rare, low frequency, and common variants with disparate effects on risk, may explain the difficulties in replicating the original GWAS results. Additional studies that take into account the complex allelic architecture at this locus are needed to confirm these findings.
Project description:BACKGROUND:The cardiac sodium channel SCN5A regulates atrioventricular and ventricular conduction. Genetic variants in this gene are associated with PR and QRS intervals. We sought to characterize further the contribution of rare and common coding variation in SCN5A to cardiac conduction. METHODS AND RESULTS:In Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study, we performed targeted exonic sequencing of SCN5A (n=3699, European ancestry individuals) and identified 4 common (minor allele frequency >1%) and 157 rare variants. Common and rare SCN5A coding variants were examined for association with PR and QRS intervals through meta-analysis of European ancestry participants from CHARGE, National Heart, Lung, and Blood Institute's Exome Sequencing Project (n=607), and the UK10K (n=1275) and by examining Exome Sequencing Project African ancestry participants (n=972). Rare coding SCN5A variants in aggregate were associated with PR interval in European and African ancestry participants (P=1.3×10(-3)). Three common variants were associated with PR and QRS interval duration among European ancestry participants and one among African ancestry participants. These included 2 well-known missense variants: rs1805124 (H558R) was associated with PR and QRS shortening in European ancestry participants (P=6.25×10(-4) and P=5.2×10(-3), respectively) and rs7626962 (S1102Y) was associated with PR shortening in those of African ancestry (P=2.82×10(-3)). Among European ancestry participants, 2 novel synonymous variants, rs1805126 and rs6599230, were associated with cardiac conduction. Our top signal, rs1805126 was associated with PR and QRS lengthening (P=3.35×10(-7) and P=2.69×10(-4), respectively) and rs6599230 was associated with PR shortening (P=2.67×10(-5)). CONCLUSIONS:By sequencing SCN5A, we identified novel common and rare coding variants associated with cardiac conduction.
Project description:BackgroundGenome-wide association studies have identified multiple genomic loci associated with coronary artery disease, but most are common variants in non-coding regions that provide limited information on causal genes and etiology of the disease. To overcome the limited scope that common variants provide, we focused our investigation on low-frequency and rare sequence variations primarily residing in coding regions of the genome.Methods and resultsUsing samples of individuals of European ancestry from ten cohorts within the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, both cross-sectional and prospective analyses were conducted to examine associations between genetic variants and myocardial infarction (MI), coronary heart disease (CHD), and all-cause mortality following these events. For prevalent events, a total of 27,349 participants of European ancestry, including 1831 prevalent MI cases and 2518 prevalent CHD cases were used. For incident cases, a total of 55,736 participants of European ancestry were included (3,031 incident MI cases and 5,425 incident CHD cases). There were 1,860 all-cause deaths among the 3,751 MI and CHD cases from six cohorts that contributed to the analysis of all-cause mortality. Single variant and gene-based analyses were performed separately in each cohort and then meta-analyzed for each outcome. A low-frequency intronic variant (rs988583) in PLCL1 was significantly associated with prevalent MI (OR = 1.80, 95% confidence interval: 1.43, 2.27; P = 7.12 × 10-7). We conducted gene-based burden tests for genes with a cumulative minor allele count (cMAC) ≥ 5 and variants with minor allele frequency (MAF) < 5%. TMPRSS5 and LDLRAD1 were significantly associated with prevalent MI and CHD, respectively, and RC3H2 and ANGPTL4 were significantly associated with incident MI and CHD, respectively. No loci were significantly associated with all-cause mortality following a MI or CHD event.ConclusionThis study identified one known locus (ANGPTL4) and four new loci (PLCL1, RC3H2, TMPRSS5, and LDLRAD1) associated with cardiovascular disease risk that warrant further investigation.
Project description:The role that vitamin D plays in pulmonary function remains uncertain. Epidemiological studies reported mixed findings for serum 25-hydroxyvitamin D (25(OH)D)-pulmonary function association. We conducted the largest cross-sectional meta-analysis of the 25(OH)D-pulmonary function association to date, based on nine European ancestry (EA) cohorts (n 22 838) and five African ancestry (AA) cohorts (n 4290) in the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium. Data were analysed using linear models by cohort and ancestry. Effect modification by smoking status (current/former/never) was tested. Results were combined using fixed-effects meta-analysis. Mean serum 25(OH)D was 68 (sd 29) nmol/l for EA and 49 (sd 21) nmol/l for AA. For each 1 nmol/l higher 25(OH)D, forced expiratory volume in the 1st second (FEV1) was higher by 1·1 ml in EA (95 % CI 0·9, 1·3; P<0·0001) and 1·8 ml (95 % CI 1·1, 2·5; P<0·0001) in AA (P race difference=0·06), and forced vital capacity (FVC) was higher by 1·3 ml in EA (95 % CI 1·0, 1·6; P<0·0001) and 1·5 ml (95 % CI 0·8, 2·3; P=0·0001) in AA (P race difference=0·56). Among EA, the 25(OH)D-FVC association was stronger in smokers: per 1 nmol/l higher 25(OH)D, FVC was higher by 1·7 ml (95 % CI 1·1, 2·3) for current smokers and 1·7 ml (95 % CI 1·2, 2·1) for former smokers, compared with 0·8 ml (95 % CI 0·4, 1·2) for never smokers. In summary, the 25(OH)D associations with FEV1 and FVC were positive in both ancestries. In EA, a stronger association was observed for smokers compared with never smokers, which supports the importance of vitamin D in vulnerable populations.