Quantitative trait loci mapped for TCF21 binding, chromatin accessibility and chromosomal looping in coronary artery smooth muscle cells reveal molecular mechanisms of coronary disease loci [array]
Ontology highlight
ABSTRACT: To better understand the fundamental molecular mechanisms that contribute to complex human diseases such as coronary artery disease (CAD), we have created a catalog of genetic variants associated with three stages of transcriptional cis-regulation in primary human coronary artery vascular smooth muscle cells. To this end, we used a pooling approach to map quantitative trait locus associations (QTLs) for TCF21 binding (ChIPseq), chromatin accessibility (ATACseq), and chromosomal looping (HiC). We find significant overlap of these QTLs, and several analyses indicate their relationship to smooth muscle specific genes, the binding of smooth muscle transcription factors, and enrichment in CAD-associated loci. These QTLs are extensively validated and allele-specific chromatin looping at the FN1 disease locus is shown to be mediated by activation of the CAD-associated TGFb1 pathway. In sum, these results uncover thousands of loci affecting cis-regulation in a key cell type for CAD, including many that may contribute to CAD risk.
Project description:To better understand the fundamental molecular mechanisms that contribute to complex human diseases such as coronary artery disease (CAD), we have created a catalog of genetic variants associated with three stages of transcriptional cis-regulation in primary human coronary artery vascular smooth muscle cells. To this end, we used a pooling approach to map quantitative trait locus associations (QTLs) for TCF21 binding (ChIPseq), chromatin accessibility (ATACseq), and chromosomal looping (HiC). We find significant overlap of these QTLs, and several analyses indicate their relationship to smooth muscle specific genes, the binding of smooth muscle transcription factors, and enrichment in CAD-associated loci. These QTLs are extensively validated and allele-specific chromatin looping at the FN1 disease locus is shown to be mediated by activation of the CAD-associated TGFb1 pathway. In sum, these results uncover thousands of loci affecting cis-regulation in a key cell type for CAD, including many that may contribute to CAD risk.
Project description:To better understand the fundamental molecular mechanisms that contribute to complex human diseases such as coronary artery disease (CAD), we have created a catalog of genetic variants associated with three stages of transcriptional cis-regulation in primary human coronary artery vascular smooth muscle cells. To this end, we used a pooling approach to map quantitative trait locus associations (QTLs) for TCF21 binding (ChIPseq), chromatin accessibility (ATACseq), and chromosomal looping (HiC). We find significant overlap of these QTLs, and several analyses indicate their relationship to smooth muscle specific genes, the binding of smooth muscle transcription factors, and enrichment in CAD-associated loci. These QTLs are extensively validated and allele-specific chromatin looping at the FN1 disease locus is shown to be mediated by activation of the CAD-associated TGFb1 pathway. In sum, these results uncover thousands of loci affecting cis-regulation in a key cell type for CAD, including many that may contribute to CAD risk.
Project description:To better understand the fundamental molecular mechanisms that contribute to complex human diseases such as coronary artery disease (CAD), we have created a catalog of genetic variants associated with three stages of transcriptional cis-regulation in primary human coronary artery vascular smooth muscle cells. To this end, we used a pooling approach to map quantitative trait locus associations (QTLs) for TCF21 binding (ChIPseq), chromatin accessibility (ATACseq), and chromosomal looping (HiC). We find significant overlap of these QTLs, and several analyses indicate their relationship to smooth muscle specific genes, the binding of smooth muscle transcription factors, and enrichment in CAD-associated loci. These QTLs are extensively validated and allele-specific chromatin looping at the FN1 disease locus is shown to be mediated by activation of the CAD-associated TGFb1 pathway. In sum, these results uncover thousands of loci affecting cis-regulation in a key cell type for CAD, including many that may contribute to CAD risk.
Project description:To better understand the fundamental molecular mechanisms that contribute to complex human diseases such as coronary artery disease (CAD), we have created a catalog of genetic variants associated with three stages of transcriptional cis-regulation in primary human coronary artery vascular smooth muscle cells. To this end, we used a pooling approach to map quantitative trait locus associations (QTLs) for TCF21 binding (ChIPseq), chromatin accessibility (ATACseq), and chromosomal looping (HiC). We find significant overlap of these QTLs, and several analyses indicate their relationship to smooth muscle specific genes, the binding of smooth muscle transcription factors, and enrichment in CAD-associated loci. These QTLs are extensively validated and allele-specific chromatin looping at the FN1 disease locus is shown to be mediated by activation of the CAD-associated TGFb1 pathway. In sum, these results uncover thousands of loci affecting cis-regulation in a key cell type for CAD, including many that may contribute to CAD risk.
Project description:Coronary artery disease (CAD) is the leading cause of mortality and morbidity driven by both genetic and environmental risk factors. Meta-analyses of genome-wide association studies (GWAS) have identified multiple single nucleotide polymorphisms (SNPs) associated with CAD and myocardial infarction (MI) susceptibility in multi-ethnic populations. The majority of these variants reside in non-coding regulatory regions and are co-inherited with hundreds of candidate regulatory SNPs. Herein, we use integrative genomic, epigenomic, and transcriptomic fine-mapping in human coronary artery smooth muscle cells (HCASMC) and tissues to identify causal regulatory variation and mechanisms responsible for CAD associations. Using these genome-wide maps we prioritize 65 candidate variants and perform allele-specific binding and expression analyses on 7 top candidates. We validate our findings in two independent cohorts of diseased human arterial expression quantitative trait loci (eQTL), which together demonstrate fundamental links between CAD associations and regulatory function in the appropriate disease context. We performed ATAC-seq, ChIP-seq, and RNA-seq on human coronary artery smooth muscle cells grown in SmGM-2 Smooth Muscle Growth Medium-2 including hEGF, insulin, hFGF-B and FBS, but without antibiotics (Lonza, #CC-3182). For ATAC-seq and RNA-seq we performed stimulations with growth factors (TGF-B1, PDGF-BB, PDGF-DD) versus serum-free control. We conducted two biological replicates for each condition using independent donors. For ATAC-seq experiments, sequencing was completed on an Illumina Hiseq 2500, paired-end 50bp reads. For ChIP-seq we performed immunoprecipitations using H3K27ac (Abcam ab4729). We conducted two biological replicates using HCASMC from independent donors, and also did an IgG control for these studies. For RNA-seq we also conducted two replicates using HCASMC from independent donors. For both ChIP-seq and RNA-seq experiments, sequencing was completed on an Illumina HiSeq 2500, paired-end 100bp reads. We also performed ex-vivo ATAC-seq on frozen tissues (isolated media) from normal and atherosclerotic human coronary arteries, using three independent donors for each. Sequencing was also completed on an Illumina HiSeq 2500, paired end 50bp reads.
Project description:Recent meta-analyses of genome wide association studies (GWAS) have identified approximately 150 loci that are associated with coronary artery disease (CAD). To link the causal genes in these loci to functional transcriptional networks, we have used chromatin immunoprecipitation sequencing (ChIP-Seq) with human coronary artery smooth muscle cells (HCASMC) to investigate the cellular and molecular program downstream of one CAD associated transcription factor, TCF21. Analysis of the TCF21 downstream target genes for enrichment of molecular and cellular annotation terms identified processes relevant to CAD pathophysiology, including growth factor binding (PDGF, VEGF), matrix interactions (“integrin binding,” “cell adhesion”), and smooth muscle contraction (“actin filament-based processes,” “actin cytoskeleton”). Motif searches of peak sequences confirmed the canonical E-box CAGCTG as the likely binding sequence for TCF21, but also identified bZip motifs that coordinate binding of AP1 family transcription factors. Follow-up ChIP-Seq studies verified enriched binding of bZip factors JUN and JUND to TCF21 target loci. Importantly, analysis of the representation of CAD genes among TCF21 target loci showed highly significant enrichment. Further, expression quantitative trait variation mapped to target genes of TCF21 were significantly enriched among variants with low P-values in the GWAS analyses, and single nucleotide polymorphisms within TCF21 peaks were shown to be in linkage disequilibrium with CAD-associated SNPs, suggesting a functional interaction between TCF21 binding and causal variants in other CAD disease loci. Thus, data and analyses presented here provide evidence for a transcriptional regulatory network that links TCF21 function in smooth muscle cells with a number of other CAD-associated genes, and suggest that study of GWAS transcription factors may be a highly useful approach to identifying disease gene interactions and thus pathways that may be relevant to complex disease etiology. We did ChIP-Seq on human coronary artery smooth muscle cells grown in SmGM-2 Smooth Muscle Growth Medium-2 including hEGF, insulin, hFGF-B and FBS, but without antibiotics (Lonza, #CC-3182. We did immunoprecipitations with two antibodies (Sigma HPA013189 and Abcam 49475). We conducted two biological replicates for each antibody, and also did an IgG control for these studies. For these experiments, sequencing was done on an Illumina GAII, single end reads. In separate set of experiments we performed ChIP-Seq for JUN (Ab sc-1694) and JUND (Ab sc-74) with the same HCASMC cells under the same culture conditions. For these studies we did one replicate for each antibody, and also included an IgG control. Sequencing for the JUN and JUND studies was paired ends, on an Illumina HiSeq machine.
Project description:Coronary artery disease (CAD) is the leading cause of mortality and morbidity driven by both genetic and environmental risk factors. Meta-analyses of genome-wide association studies (GWAS) have identified multiple single nucleotide polymorphisms (SNPs) associated with CAD and myocardial infarction (MI) susceptibility in multi-ethnic populations. The majority of these variants reside in non-coding regulatory regions and are co-inherited with hundreds of candidate regulatory SNPs. Herein, we use integrative genomic, epigenomic, and transcriptomic fine-mapping in human coronary artery smooth muscle cells (HCASMC) and tissues to identify causal regulatory variation and mechanisms responsible for CAD associations. Using these genome-wide maps we prioritize 65 candidate variants and perform allele-specific binding and expression analyses on 7 top candidates. We validate our findings in two independent cohorts of diseased human arterial expression quantitative trait loci (eQTL), which together demonstrate fundamental links between CAD associations and regulatory function in the appropriate disease context.
Project description:Coronary artery disease (CAD) is the leading cause of death globally. Genome-wide association studies (GWAS) have identified more than 130 independent loci that influence CAD risk, most of which reside in non-coding regions of the genome. To interpret these loci, we generated transcriptome and whole-genome datasets using human coronary artery smooth muscle cells (HCASMC) from 52 unrelated donors, as well as ATAC-seq epigenomic datasets on a subset of 8 donors. Through systematic comparison with publicly available datasets from GTEx and ENCODE projects, we identify transcriptomic, epigenetic, and genetic regulatory mechanisms specific to HCASMC. We validate the relevance of HCASMC to CAD risk using transcriptomic and epigenomic level analyses. By jointly modeling eQTL and GWAS datasets, we identified five genes (SIPA1, TCF21, SMAD3, FES, and PDGFRA) that modulate CAD risk through HCASMC, all of which have biologically relevant functional roles in vascular remodeling. Comparison with GTEx data suggests that SIPA1 appears to influence CAD risk predominantly through HCASMC, while other annotated genes may have multiple cell targets. Together, these results provide new biological insights into the regulation of a critical vascular cell type associated with CAD in human populations.
Project description:Coronary artery disease (CAD) is a complex inflammatory disease involving genetic influences across several cell types. Genome-wide association studies (GWAS) have identified over 170 loci associated with CAD, where the majority of risk variants reside in noncoding DNA sequences impacting cis-regulatory elements (CREs). Here, we applied single-cell ATAC-seq to profile 28,316 cells across coronary artery segments from 41 patients with varying stages of CAD, which revealed 14 distinct cellular clusters. We mapped ~320,000 accessible sites across all cells, identified cell type-specific elements, transcription factors, and prioritized functional CAD risk variants via quantitative trait locus and sequence-based predictive modeling. We identified a number of candidate mechanisms for smooth muscle cell transition states and identified putative binding sites for risk variants. We further employed CRE to gene linkage to nominate disease-associated key driver transcription factors such as PRDM16 and TBX2. This single cell atlas provides a critical step towards interpreting cis-regulatory mechanisms in the vessel wall across the continuum of CAD risk.
Project description:Recent meta-analyses of genome wide association studies (GWAS) have identified approximately 150 loci that are associated with coronary artery disease (CAD). To link the causal genes in these loci to functional transcriptional networks, we have used chromatin immunoprecipitation sequencing (ChIP-Seq) with human coronary artery smooth muscle cells (HCASMC) to investigate the cellular and molecular program downstream of one CAD associated transcription factor, TCF21. Analysis of the TCF21 downstream target genes for enrichment of molecular and cellular annotation terms identified processes relevant to CAD pathophysiology, including growth factor binding (PDGF, VEGF), matrix interactions (“integrin binding,” “cell adhesion”), and smooth muscle contraction (“actin filament-based processes,” “actin cytoskeleton”). Motif searches of peak sequences confirmed the canonical E-box CAGCTG as the likely binding sequence for TCF21, but also identified bZip motifs that coordinate binding of AP1 family transcription factors. Follow-up ChIP-Seq studies verified enriched binding of bZip factors JUN and JUND to TCF21 target loci. Importantly, analysis of the representation of CAD genes among TCF21 target loci showed highly significant enrichment. Further, expression quantitative trait variation mapped to target genes of TCF21 were significantly enriched among variants with low P-values in the GWAS analyses, and single nucleotide polymorphisms within TCF21 peaks were shown to be in linkage disequilibrium with CAD-associated SNPs, suggesting a functional interaction between TCF21 binding and causal variants in other CAD disease loci. Thus, data and analyses presented here provide evidence for a transcriptional regulatory network that links TCF21 function in smooth muscle cells with a number of other CAD-associated genes, and suggest that study of GWAS transcription factors may be a highly useful approach to identifying disease gene interactions and thus pathways that may be relevant to complex disease etiology.