Project description:We used SLIC-CAGE to map transcription start sites (TSSs) of mouse primordial germ cells from embryonic days 9.5-16.5, postnatal oocytes (P6, P14 and MII), and early 2-cell and 4-cell mouse embryos. We use this TSS data to show that the mouse germline development starts with the somatic promoter code with a prominent switch to the maternal code (W-box dependent) occurring during the follicular oogenesis. We also find that the promoters of gonadal germ cells are characterised by a previously unknown divergence from the somatic transcription initiation. This divergence is distinct from the promoter code used later by the developing oocytes and reveals genome-wide promoter remodelling during early female and male germline development.
Project description:We used SLIC-CAGE to map transcriptional start sites (TSSs) of P14 WT and P14 Tbpl2-/- mutant mouse oocytes. Comparison of WT and Tbpl2-/- oocytes demonstrates that Tbpl2 guides transcriptional start site selection in the growing oocyte. This TSS selection is different compared to the canonical somatic type of TSS selection and depends on TATA-like elements as core promoter motifs, recognised by Tbpl2.
Project description:An increasing number of non-coding RNAs (ncRNAs) are implicated in various human diseases including cancer; however ncRNA transcriptome of hepatocellular carcinoma (HCC) remains largely unexplored. We use CAGE (Cap Analysis of Gene Expression) to comprehensively map transcription start sites (TSSs) across different etiologies of human HCC as well as mouse HCC, with particular emphasis on ncRNAs distant from protein-coding genes. We find thousands of significantly up-regulated distal ncRNAs in HCC tumors compared to their matched non-tumors, which are as many as protein-coding genes. Moreover, we identify many LTR retroviral promoters activated in HCC tissues and expressed in a subfamily-specific manner, which account for approximately 20% of the up-regulated distal ncRNAs. The transcripts derived from LTRs, determined by 3' RACE, are multi-exon nuclear ncRNAs typically 0.5-2kb in length. This study sheds light on ncRNA transcriptome of human and mouse HCC. Expression profiles using CAGE for 37 mouse HCC. The human data are archived at dbGaP (phs000885.v1.p1). An umbrella BioProject has been created to associate the GEO and dbGaP BioProjects: PRJNA278792
Project description:An increasing number of non-coding RNAs (ncRNAs) are implicated in various human diseases including cancer; however ncRNA transcriptome of hepatocellular carcinoma (HCC) remains largely unexplored. We use CAGE (Cap Analysis of Gene Expression) to comprehensively map transcription start sites (TSSs) across different etiologies of human HCC as well as mouse HCC, with particular emphasis on ncRNAs distant from protein-coding genes. We find thousands of significantly up-regulated distal ncRNAs in HCC tumors compared to their matched non-tumors, which are as many as protein-coding genes. Moreover, we identify many LTR retroviral promoters activated in HCC tissues and expressed in a subfamily-specific manner, which account for approximately 20% of the up-regulated distal ncRNAs. The transcripts derived from LTRs, determined by 3' RACE, are multi-exon nuclear ncRNAs typically 0.5-2kb in length. This study sheds light on ncRNA transcriptome of human and mouse HCC.
Project description:The mammalian sex-determining gene Sry induces male development. Through analyzing transcriptome of pre-Sertoli cells, we identified a novel sequence transcribed within palindromic sequence surrounding Sry. To examine whether this sequence contains transcription start site, Cap analysis gene expression (CAGE) -seq analysis was performed, in which putative transcriptional start sites can be identified by sequencing the 3’ end of cDNA corresponding to 5’ end of RNA. CAGE seq revealed that Cap site was absent within the novel sequence, whereas Sry transcription start site found to locate in minus strand at chrY:2,663,800-2,663,900. Combining the result from long-read RNA seq, we concluded that this sequence is the second exon of mouse Sry.
Project description:We collected sample-matched multiomics data including RNA-seq, transcription start site sequencing (TSS-seq), termination sequencing (term-seq), ribosome profiling, and label-free shotgun proteomic mass spectrometry across different growth conditions to improve annotation and assign functional sites in the Zymomonas mobilis subsp. mobilils ZM4 genome. Proteomics and ribosome profiling informed revisions of protein-coding genes, which included 44 start codon changes and 42 added proteins. We developed statistical methods for annotating transcript 5′ and 3′ ends enabling identification of 3940 TSSs and their corresponding promoters and 2091 transcription termination sites, which were distinguished from RNA processing sites by lack of an adjacent RNA 5′ end.
Project description:The CAGE experiment was performed in four different chemostat conditions to assess if and how much the Transcription start site landscape changes in the industrial relevant S. Cerevisiae strain CEN.PK113-7D in different conditions. CAGE was also used to obtain accurate TSS annotations for each expressed gene.
Project description:We develop a method Re-Cappable-seq for determining eukaryotic transcription start sites derived from all RNA polymerases at nucleotide resolution. In particular, this method identifies the Pol-I and Pol-III TSSs, which are missing by CAGE. Applied to human A549 cell line, our method results in the identification of 33,468 and 5,269 high confidence Pol-II and non-Pol-II TSS respectively. Re-Cappable-seq identifies Pol-II TSS with higher specificity than CAGE.
Project description:We performed CAGE-Seq on dissected ovaries and testes as well as female and male carcasses of two species of Drosophila (D. melanogaster and D. pseudoobscura). These data are used to map transcription start sites. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf