Project description:Control of RNA transcription is critical for the development and homeostasis of all organisms, and can occur at multiple steps of the transcription cycle, including RNA polymerase II (Pol II) recruitment, initiation, promoter-proximal pausing, and elongation. That Pol II accumulates on many promoters in metazoans implies that steps other than Pol II recruitment are rate-limiting and regulated 1-6. By integrating genome-wide Pol II chromatin immunoprecipition (ChIP) and Global Run-On (GRO) genomic data sets from Drosophila cells, we examined critical features of Pol II near promoters. The accumulation of promoter-proximal polymerase is widespread, occurring on 70% of active genes; and unlike elongating Pol II within the body of genes, promoter Pol II are held paused by factors like NELF, unable to transcribe unless nuclei are treated with strong detergent. Notably, we find that the vast majority of promoter-proximal Pol II detected by ChIP are paused, thereby identifying the biochemical nature of this rate-limiting step in transcription. Finally, we demonstrate that Drosophila promoters do not have the upstream divergent Pol II that is seen so broadly and prominently on mammalian promoters. We postulate this is a consequence of Drosophila’s extensive use of directional core promoter sequence elements, which contrasts with mammals’ lack of directional elements and prevalence of CpG island core promoters. In support of this idea, we show that the fraction of mammalian promoters containing a TATA box core element is dramatically depleted of upstream divergent transcription.
Project description:Control of RNA transcription is critical for the development and homeostasis of all organisms, and can occur at multiple steps of the transcription cycle, including RNA polymerase II (Pol II) recruitment, initiation, promoter-proximal pausing, and elongation. That Pol II accumulates on many promoters in metazoans implies that steps other than Pol II recruitment are rate-limiting and regulated 1-6. By integrating genome-wide Pol II chromatin immunoprecipition (ChIP) and Global Run-On (GRO) genomic data sets from Drosophila cells, we examined critical features of Pol II near promoters. The accumulation of promoter-proximal polymerase is widespread, occurring on 70% of active genes; and unlike elongating Pol II within the body of genes, promoter Pol II are held paused by factors like NELF, unable to transcribe unless nuclei are treated with strong detergent. Notably, we find that the vast majority of promoter-proximal Pol II detected by ChIP are paused, thereby identifying the biochemical nature of this rate-limiting step in transcription. Finally, we demonstrate that Drosophila promoters do not have the upstream divergent Pol II that is seen so broadly and prominently on mammalian promoters. We postulate this is a consequence of Drosophila’s extensive use of directional core promoter sequence elements, which contrasts with mammals’ lack of directional elements and prevalence of CpG island core promoters. In support of this idea, we show that the fraction of mammalian promoters containing a TATA box core element is dramatically depleted of upstream divergent transcription. ChIP-seq data set for Pol II (rpb3) (2 replicates).
Project description:Control of RNA transcription is critical for the development and homeostasis of all organisms, and can occur at multiple steps of the transcription cycle, including RNA polymerase II (Pol II) recruitment, initiation, promoter-proximal pausing, and elongation. That Pol II accumulates on many promoters in metazoans implies that steps other than Pol II recruitment are rate-limiting and regulated 1-6. By integrating genome-wide Pol II chromatin immunoprecipition (ChIP) and Global Run-On (GRO) genomic data sets from Drosophila cells, we examined critical features of Pol II near promoters. The accumulation of promoter-proximal polymerase is widespread, occurring on 70% of active genes; and unlike elongating Pol II within the body of genes, promoter Pol II are held paused by factors like NELF, unable to transcribe unless nuclei are treated with strong detergent. Notably, we find that the vast majority of promoter-proximal Pol II detected by ChIP are paused, thereby identifying the biochemical nature of this rate-limiting step in transcription. Finally, we demonstrate that Drosophila promoters do not have the upstream divergent Pol II that is seen so broadly and prominently on mammalian promoters. We postulate this is a consequence of Drosophila’s extensive use of directional core promoter sequence elements, which contrasts with mammals’ lack of directional elements and prevalence of CpG island core promoters. In support of this idea, we show that the fraction of mammalian promoters containing a TATA box core element is dramatically depleted of upstream divergent transcription.
Project description:The human K562 chronic myeloid leukemia cell line has long served as an experimental paradigm for functional genomic studies. To systematically and functionally annotate the human genome, the ENCODE consortium generated hundreds of functional genomic data sets, such as chromatin immunoprecipitation coupled to sequencing (ChIP-seq). While ChIP-seq analyses have provided tremendous insights into gene regulation, spatiotemporal insights were limited by a resolution of several hundred base pairs. ChIP-exonuclease (ChIP-exo) is a refined version of ChIP-seq that overcomes this limitation by providing higher precision mapping of protein-DNA interactions. To study the interplay of transcription initiation and chromatin, we profiled the genome-wide locations for RNA polymerase II (Pol II), the histone variant H2A.Z, and the histone modification H3K4me3 using ChIP-seq and ChIP-exo. In this Data Descriptor, we present detailed information on parallel experimental design, data generation, quality control analysis, and data validation. We discuss how these data lay the foundation for future analysis to understand the relationship between the occupancy of Pol II and nucleosome positions at near base pair resolution.
Project description:While a role of promoter-proximal RNA Polymerase II (Pol II) pausing in regulation of eukaryotic gene expression is implied, the mechanisms and dynamics of this process are poorly understood. We performed genome-wide analysis of short capped RNAs (scRNAs) and Pol II chromatin immunoprecipitation sequencing (ChIP-seq) in human breast cancer MCF-7 cells to better understand Pol II pausing (Samarakkody, A., Abbas, A., Scheidegger, A., Warns, J., Nnoli, O., Jokinen, B., Zarns, K., Kubat, B., Dhasarathy, A. and Nechaev, S. (2015) RNA polymerase II pausing can be retained or acquired during activation of genes involved in the epithelial to mesenchymal transition. Nucleic Acids Res43, 3938-3949). The data are available at the NCBI Gene Expression Omnibus under accession number GSE67041. For both ChIP and scRNA samples, we used paired end sequencing on the Illumina MiSeq instrument. For ChIP-seq, the use of paired end sequencing allowed us to avoid ambiguities in center-read definition. For scRNA seq, this allowed us to identify both the 5'-end and the 3'-end in the same run that represent, respectively, the transcription start sites and the locations of Pol II pausing. The sharpening of Pol II ChIP-seqmetagene profiles when aligned against 5'-ends of scRNAs indicates that these RNAs can be used to define the start sites for the majority of mRNA transcription events.
Project description:Copper homoeostasis was investigated in the Drosophila melanogaster S2 cell line to develop an insect model for the study of copper regulation. Real-time PCR studies have demonstrated expression in S2 cells of putative orthologues of human Cu regulatory genes involved in the uptake, transport, sequestration and efflux of Cu. Drosophila orthologues of the mammalian Cu chaperones, ATOX1 (a human orthologue of yeast ATX1), CCS (copper chaperone for superoxide dismutase), COX17 (a human orthologue of yeast COX17), and SCO1 and SCO2, did not significantly respond transcriptionally to increased Cu levels, whereas MtnA, MtnB and MtnD (Drosophila orthologues of human metallothioneins) were up-regulated by Cu in a time- and dose-dependent manner. To examine the effect on Cu homoeostasis, expression of several key copper homoeostasis genes was suppressed using double-stranded RNA interference. Suppression of the MTF-1 (metal-regulatory transcription factor 1), reduced both basal and Cu-induced gene expressions of MtnA, MtnB and MtnD, significantly reducing the tolerance of these cells to increased Cu. Suppression of either Ctr1A (a Drosophila orthologue of yeast CTR1) or Ctr1B significantly reduced Cu uptake from media, demonstrating that both these proteins function to transport Cu into S2 cells. Significantly, Cu induced Ctr1B gene expression, and this could be prevented by suppressing MTF-1, suggesting that Ctr1B might be involved in Cu detoxification. Suppression of DmATP7, the putative homologue of human Cu transporter genes ATP7A and ATP7B, significantly increased Cu accumulation, demonstrating that DmATP7 is essential for efflux of excess Cu. This work is consistent with previous studies in mammalian cells, validating S2 cells as a model system for studying Cu transport and identifying novel Cu regulatory mechanisms.
Project description:Recent advances in sequencing technology have opened a new era in RNA studies. Novel types of RNAs such as long non-coding RNAs (lncRNAs) have been discovered by transcriptomic sequencing and some lncRNAs have been found to play essential roles in biological processes. However, only limited information is available for lncRNAs in Drosophila melanogaster, an important model organism. Therefore, the characterization of lncRNAs and identification of new lncRNAs in D. melanogaster is an important area of research. Moreover, there is an increasing interest in the use of ChIP-seq data (H3K4me3, H3K36me3 and Pol II) to detect signatures of active transcription for reported lncRNAs.We have developed a computational approach to identify new lncRNAs from two tissue-specific RNA-seq datasets using the poly(A)-enriched and the ribo-zero method, respectively. In our results, we identified 462 novel lncRNA transcripts, which we combined with 4137 previously published lncRNA transcripts into a curated dataset. We then utilized 61 RNA-seq and 32 ChIP-seq datasets to improve the annotation of the curated lncRNAs with regards to transcriptional direction, exon regions, classification, expression in the brain, possession of a poly(A) tail, and presence of conventional chromatin signatures. Furthermore, we used 30 time-course RNA-seq datasets and 32 ChIP-seq datasets to investigate whether the lncRNAs reported by RNA-seq have active transcription signatures. The results showed that more than half of the reported lncRNAs did not have chromatin signatures related to active transcription. To clarify this issue, we conducted RT-qPCR experiments and found that ~95.24% of the selected lncRNAs were truly transcribed, regardless of whether they were associated with active chromatin signatures or not.In this study, we discovered a large number of novel lncRNAs, which suggests that many remain to be identified in D. melanogaster. For the lncRNAs that are known, we improved their characterization by integrating a large number of sequencing datasets (93 sets in total) from multiple sources (lncRNAs, RNA-seq and ChIP-seq). The RT-qPCR experiments demonstrated that RNA-seq is a reliable platform to discover lncRNAs. This set of curated lncRNAs with improved annotations can serve as an important resource for investigating the function of lncRNAs in D. melanogaster.
Project description:This chapter presents methods to conduct and analyze genome-wide chromatin immunoprecipitation of the cohesin complex and the Nipped-B cohesin loading factor in Drosophila cells using high-throughput DNA sequencing (ChIP-seq). Procedures for isolation of chromatin, immunoprecipitation, and construction of sequencing libraries for the Ion Torrent Proton high throughput sequencer are detailed, and computational methods to calculate occupancy as input-normalized fold-enrichment are described. The results obtained by ChIP-seq are compared to those obtained by ChIP-chip (genomic ChIP using tiling microarrays), and the effects of sequencing depth on the accuracy are analyzed. ChIP-seq provides similar sensitivity and reproducibility as ChIP-chip, and identifies the same broad regions of occupancy. The locations of enrichment peaks, however, can differ between ChIP-chip and ChIP-seq, and low sequencing depth can splinter broad regions of occupancy into distinct peaks.
Project description:ChIP-seq study analysing adult Drosophila melanogaster head, glial, neuronal and fat body, as well as embryonic RNA pol II and H2A.v binding by employing the GAL4-UAS system to generate GFP-fusion proteins and ChIP-seq