Atypical modes of CTCF binding facilitate tissue-specific and neuronal activity-dependent gene expression states [SMC1_ChIP-seq]
Ontology highlight
ABSTRACT: The multivalent binding of CTCF to a variety of DNA sequences is thought to underlie its ability to mediate a large repertoire of cellular functions. CTCF is a 11 zinc-finger (ZF) protein that is anchored to a majority of its target sites via binding to a 20 base-pair core DNA sequence. Yet the diversity of CTCF binding sites (CBS) has not fully been characterized. Here we assessed CTCF occupancy in cultured mouse cortical neurons as a function of neuronal activity and observed that ~ 22 % of CBS lack the consensus CTCF motif. We report that sequence diversity at most of these atypical CBS is not random but involves degeneracy at specific nucleotide positions within the core CTCF position weight matrix (PWM) that likely affect the binding of ZFs 6 and 7. Surprisingly, degeneracy at the same nucleotides not only define most atypical CBS, but also CBS within most gene promoters, and CBS that are dynamically altered following neuronal stimulation, revealing how atypical CTCF binding could affect gene activity. Dynamic CBS across neural differentiation and neuronal stimulation are found both within and outside loop anchors and TADs, indicating that dynamic CBS could regulate gene activity independently of loop anchoring. Finally, we identified a second mode of atypical CTCF binding that defines most tissue-specific CBS. Unlike other atypical CBS, tissue-specific CBS largely bind sequences unrelated to the core CTCF motif and are enriched within the bodies of tissue-specific genes. Overall, these results indicate how CTCF binding at atypical CBS could allow it to dynamically regulate gene activity patterns during differentiation, development, and in response to environmental cues.
Project description:The multivalent binding of CTCF to a variety of DNA sequences is thought to underlie its ability to mediate a large repertoire of cellular functions. CTCF is a 11 zinc-finger (ZF) protein that is anchored to a majority of its target sites via binding to a 20 base-pair core DNA sequence. Yet the diversity of CTCF binding sites (CBS) has not fully been characterized. Here we assessed CTCF occupancy in cultured mouse cortical neurons as a function of neuronal activity and observed that ~ 22 % of CBS lack the consensus CTCF motif. We report that sequence diversity at most of these atypical CBS is not random but involves degeneracy at specific nucleotide positions within the core CTCF position weight matrix (PWM) that likely affect the binding of ZFs 6 and 7. Surprisingly, degeneracy at the same nucleotides not only define most atypical CBS, but also CBS within most gene promoters, and CBS that are dynamically altered following neuronal stimulation, revealing how atypical CTCF binding could affect gene activity. Dynamic CBS across neural differentiation and neuronal stimulation are found both within and outside loop anchors and TADs, indicating that dynamic CBS could regulate gene activity independently of loop anchoring. Finally, we identified a second mode of atypical CTCF binding that defines most tissue-specific CBS. Unlike other atypical CBS, tissue-specific CBS largely bind sequences unrelated to the core CTCF motif and are enriched within the bodies of tissue-specific genes. Overall, these results indicate how CTCF binding at atypical CBS could allow it to dynamically regulate gene activity patterns during differentiation, development, and in response to environmental cues.
Project description:CTCF, a conserved 3D genome architecture protein, determines proper genome-wide chromatin looping interactions through directional binding to specific sequence elements of four modules within numerous CTCF-binding sites (CBSs) by its 11 zinc fingers (ZFs). Here, we report the crystal structures of four human CTCF-CBS complexes of the protocadherin (Pcdh) clusters, and show that directional CTCF binding to cognate CBSs of the Pcdh enhancers and promoters is achieved through inserting its ZF3, ZFs 4-7, and ZFs 9-11 into the major groove along CBSs, resulting in a sequence-specific recognition of module 4, modules 2-3, and module 1, respectively, with ZF8 as a spacer element for variable distances between modules 1 and 2. In addition, the base-contact with the asymmetric “A” in the central position of modules 2-3, is essential for directional recognition of the CBSs with symmetric core sequences and lack of module 1. Furthermore, CTCF tolerates base changes at specific positions within the degenerated CBS sequences, permitting genome-wide CTCF binding to a diverse range of CBSs. Together, these complex structures provide important insights into the molecular mechanisms for the directionality, diversity, flexibility, dynamics, and conservation of multivalent CTCF binding to its cognate sites across the entire human genome
Project description:CTCF plays an important role in 3D genome organization by adjusting insulation at TAD boundaries, where clustered CBS (CTCF-binding site) elements are often arranged in tandem array with a complex divergent or convergent orientation. Here using cPcdh and HOXD loci as a paradigm, we look into the clustered CTCF TAD boundaries and find that, counterintuitively, outward-oriented CBS elements are crucial for inward enhancer-promoter interactions as well as for gene regulation. Specifically, by combinatorial deletions of a series of putative enhancer elements in vivo or CBS elements in vitro, in conjunction with chromosome conformation capture and RNA-seq analyses, we show that deletions of outward-oriented CBS elements weaken the strength of long-distance intraTAD promoter-enhancer interactions and enhancer activation of target genes. Our data highlight the crucial role of outward-oriented CBS elements within the clustered CTCF TAD boundaries and have interesting implications on the organization principles of clustered CTCF sites within TAD boundaries.
Project description:CTCF plays an important role in 3D genome organization by adjusting insulation at TAD boundaries, where clustered CBS (CTCF-binding site) elements are often arranged in tandem array with a complex divergent or convergent orientation. Here using cPcdh and HOXD loci as a paradigm, we look into the clustered CTCF TAD boundaries and find that, counterintuitively, outward-oriented CBS elements are crucial for inward enhancer-promoter interactions as well as for gene regulation. Specifically, by combinatorial deletions of a series of putative enhancer elements in vivo or CBS elements in vitro, in conjunction with chromosome conformation capture and RNA-seq analyses, we show that deletions of outward-oriented CBS elements weaken the strength of long-distance intraTAD promoter-enhancer interactions and enhancer activation of target genes. Our data highlight the crucial role of outward-oriented CBS elements within the clustered CTCF TAD boundaries and have interesting implications on the organization principles of clustered CTCF sites within TAD boundaries.
Project description:CTCF plays an important role in 3D genome organization by adjusting insulation at TAD boundaries, where clustered CBS (CTCF-binding site) elements are often arranged in tandem array with a complex divergent or convergent orientation. Here using cPcdh and HOXD loci as a paradigm, we look into the clustered CTCF TAD boundaries and find that, counterintuitively, outward-oriented CBS elements are crucial for inward enhancer-promoter interactions as well as for gene regulation. Specifically, by combinatorial deletions of a series of putative enhancer elements in vivo or CBS elements in vitro, in conjunction with chromosome conformation capture and RNA-seq analyses, we show that deletions of outward-oriented CBS elements weaken the strength of long-distance intraTAD promoter-enhancer interactions and enhancer activation of target genes. Our data highlight the crucial role of outward-oriented CBS elements within the clustered CTCF TAD boundaries and have interesting implications on the organization principles of clustered CTCF sites within TAD boundaries.
Project description:CTCF plays an important role in 3D genome organization by adjusting insulation at TAD boundaries, where clustered CBS (CTCF-binding site) elements are often arranged in tandem array with a complex divergent or convergent orientation. Here using cPcdh and HOXD loci as a paradigm, we look into the clustered CTCF TAD boundaries and find that, counterintuitively, outward-oriented CBS elements are crucial for inward enhancer-promoter interactions as well as for gene regulation. Specifically, by combinatorial deletions of a series of putative enhancer elements in vivo or CBS elements in vitro, in conjunction with chromosome conformation capture and RNA-seq analyses, we show that deletions of outward-oriented CBS elements weaken the strength of long-distance intraTAD promoter-enhancer interactions and enhancer activation of target genes. Our data highlight the crucial role of outward-oriented CBS elements within the clustered CTCF TAD boundaries and have interesting implications on the organization principles of clustered CTCF sites within TAD boundaries.
Project description:CTCF plays an important role in 3D genome organization by adjusting insulation at TAD boundaries, where clustered CBS (CTCF-binding site) elements are often arranged in tandem array with a complex divergent or convergent orientation. Here using cPcdh and HOXD loci as a paradigm, we look into the clustered CTCF TAD boundaries and find that, counterintuitively, outward-oriented CBS elements are crucial for inward enhancer-promoter interactions as well as for gene regulation. Specifically, by combinatorial deletions of a series of putative enhancer elements in vivo or CBS elements in vitro, in conjunction with chromosome conformation capture and RNA-seq analyses, we show that deletions of outward-oriented CBS elements weaken the strength of long-distance intraTAD promoter-enhancer interactions and enhancer activation of target genes. Our data highlight the crucial role of outward-oriented CBS elements within the clustered CTCF TAD boundaries and have interesting implications on the organization principles of clustered CTCF sites within TAD boundaries.