Project description:DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding and high-throughput sequencing (PB-seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB-seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF-bound and HSF-free DNA, and then detecting HSF-bound DNA by high throughput sequencing. We compared PB-seq binding profiles with ones observed in vivo by ChIP-seq, and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase-seq data and the ChIP-chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity. We performed an in vitro binding experiment with purified HSF and naked, sheared genomic Drosophila S2 DNA (PB-seq), to derive an accurate set of potential HSF binding sites in the Drosophila genome. HSF-bound DNA was specifically eluted and detected by high throughput sequencing. Drosophila HSF was N-terminally tagged with glutathione s-transferase and a tobacco etch virus (TEV) protease cleavage site. The C-terminus of the recombinant HSF was fused to the 3xFLAG epitope. Recombinant HSF was purified from E. coli with glutathione resin as previously described (PMID: 20078429) , with the following modifications: HSF-3xFLAG elution was achieved by addition of 6xHistidine tagged TEV protease and TEV protease was cleared from the HSF preparation using a Nickel-NTA column. We incubated 600pM HSF and 2500ng genomic DNA (sonicated to 100-600bp fragment size as previously described in PMID: 20844575) in 1500μl final volume of 1xHSF binding buffer and let it come to equilibrium for an hour at room temperature. We added 20μl ANTI-FLAG M2 affinity gel for 10 minutes, washed 8 times with 1xHSF binding buffer to remove unbound DNA; 3xFLAG peptide was added to a final concentration of 200ng/μl to specifically elute HSF and HSF-bound DNA. The mock IP was done in the absence of recombinant HSF.
Project description:Sequence-specific transcription factors (TFs) are critical for specifying patterns and levels of gene expression, but the DNA elements to which they can bind are not always sufficient to specify their binding in vivo. In eukaryotes, the binding of a TF is in competition with a constellation of other proteins, including histones which package DNA into nucleosomes. Here, we examine using the ChIP-seq assay, the genome-wide distribution of Drosophila Heat Shock Factor (HSF), a TF whose binding activity is mediated by heat shock-induced trimerization. We detect HSF binding to 464 sites, the vast majority of which contain HSF Sequence-binding Elements (HSEs) in Drosophila S2 cells, but these HSF-bound sites represent only a small fraction of HSEs present in the genome. We find a strong correlation of bound HSEs to active chromatin marks present prior to HSF binding, indicating an HSEM-bM-^@M-^Ys residence in open chromatin is a primary determinant of whether HSF can bind following heat shock. A single mock immunoprecipitation (IP) using the pre-innoculated animal serum was used as a background dataset for this study. For each condition (NHS and 20minute HS), we performed two independent HSF-ChIP-seq experiments. In addition, we performed two independent HSF-ChIP-seq experiments for each condition (NHS and 20minute HS) in cells that were highly depleted of HSF by RNAi.