Project description:5-hydroxymethylcytosine (5-hmC), a derivative of 5-methylcytosine (5-mC), is abundant in the brain for unknown reasons. We mapped the genomic distribution of 5-hmC and 5-mC in human and mouse tissues using glucosylation of 5-hmC coupled with restriction enzyme digestion, and interrogation on microarrays. We detected 5-hmC enrichment in genes with synapse-related functions in the brain. We also identified significant, tissue-specific differential distributions of these DNA modifications at the exon-intron boundary, in both human and mouse. This boundary change was mainly due to 5-hmC in the brain, but due to 5-mC in non-neural contexts. This pattern was replicated in multiple independent datasets, and the brain-specific change in 5-hmC was validated using single-molecule sequencing. Moreover, in the brain, constitutive exons contained higher levels of 5-hmC, relative to alternatively-spliced exons. Our study suggests a novel role for 5-hmC in RNA splicing and synaptic function in the brain
Project description:5-hydroxymethylcytosine (5-hmC), a derivative of 5-methylcytosine (5-mC), is abundant in the brain for unknown reasons. We mapped the genomic distribution of 5-hmC and 5-mC in human and mouse tissues using glucosylation of 5-hmC coupled with restriction enzyme digestion, and interrogation on microarrays. We detected 5-hmC enrichment in genes with synapse-related functions in the brain. We also identified significant, tissue-specific differential distributions of these DNA modifications at the exon-intron boundary, in both human and mouse. This boundary change was mainly due to 5-hmC in the brain, but due to 5-mC in non-neural contexts. This pattern was replicated in multiple independent datasets, and the brain-specific change in 5-hmC was validated using single-molecule sequencing. Moreover, in the brain, constitutive exons contained higher levels of 5-hmC, relative to alternatively-spliced exons. Our study suggests a novel role for 5-hmC in RNA splicing and synaptic function in the brain
Project description:5-hydroxymethylcytosine (5-hmC), a derivative of 5-methylcytosine (5-mC), is abundant in the brain for unknown reasons. We mapped the genomic distribution of 5-hmC and 5-mC in human and mouse tissues using glucosylation of 5-hmC coupled with restriction enzyme digestion, and interrogation on microarrays. We detected 5-hmC enrichment in genes with synapse-related functions in the brain. We also identified significant, tissue-specific differential distributions of these DNA modifications at the exon-intron boundary, in both human and mouse. This boundary change was mainly due to 5-hmC in the brain, but due to 5-mC in non-neural contexts. This pattern was replicated in multiple independent datasets, and the brain-specific change in 5-hmC was validated using single-molecule sequencing. Moreover, in the brain, constitutive exons contained higher levels of 5-hmC, relative to alternatively-spliced exons. Our study suggests a novel role for 5-hmC in RNA splicing and synaptic function in the brain The M-NM-2-glucosyltransferase (BGT) enzyme transfers a glucose molecule specifically to the hydroxymethyl group of 5-hmC, thus rendering it resistant to digestion by the methylation insensitive MspI enzyme at the ChmCGG target site; 5-hmC is thus detected by differential resistance to MspI-digestion with and without glucosylation of genomic DNA (gDNA). HpaII (targets the same site, CCGG) cannot cut CmCGG or ChmCGG, and conceptually its difference with MspI digestion is a measure of both 5-mC and 5-hmC. Subtraction of 5-hmC from the HpaII-based estimate therefore measures 5-mC. These estimates were measured on respective Affymetrix whole-genome tiling arrays (2.0 R ) MspI - APRIL_fc1.ch02_coverage.bed Undigested DNA - APRIL_fc1.ch01_coverage.bed GluMspI - APRIL_fc1.ch04_coverage_NONTARGET.bed GluMspI - APRIL_fc1.ch04_coverage.bed MspI - APRIL_fc1.ch02_coverage_NONTARGET.bed Undigested DNA - APRIL_fc1.ch01_coverage_NONTARGET.bed MspI - MARCH_fc1.ch02_coverage_NONTARGET.bed Undigested DNA - MARCH_fc1.ch01_coverage_NONTARGET.bed GluMspI - MARCH_fc1.ch04_coverage_NONTARGET.bed GluMspI - MARCH_fc1.ch04_coverage.bed MspI - MARCH_fc1.ch02_coverage.bed Undigested DNA - MARCH_fc1.ch01_coverage.bed MspI - MAY_fc1.ch02_coverage.bed GluMspI - MAY_fc1.ch04_coverage_NONTARGET.bed GluMspI - MAY_fc1.ch04_coverage.bed Undigested DNA - MAY_fc1.ch01_coverage.bed MspI - MAY_fc1.ch02_coverage_NONTARGET.bed Undigested DNA - MAY_fc1.ch01_coverage_NONTARGET.bed
Project description:The 5-methylcytosine (5-mC) derivative 5-hydroxymethylcytosine (5-hmC) is abundant in the brain for unknown reasons. Here we characterize the genomic distribution of 5-hmC and 5-mC in human and mouse tissues. We assayed 5-hmC by using glucosylation coupled with restriction-enzyme digestion and microarray analysis. We detected 5-hmC enrichment in genes with synapse-related functions in both human and mouse brain. We also identified substantial tissue-specific differential distributions of these DNA modifications at the exon-intron boundary in human and mouse. This boundary change was mainly due to 5-hmC in the brain but due to 5-mC in non-neural contexts. This pattern was replicated in multiple independent data sets and with single-molecule sequencing. Moreover, in human frontal cortex, constitutive exons contained higher levels of 5-hmC relative to alternatively spliced exons. Our study suggests a new role for 5-hmC in RNA splicing and synaptic function in the brain.
Project description:Precise identification of correct exon-intron boundaries is a prerequisite to analyze the location and structure of genes. The existing framework for genomic signals, delineating exon and introns in a genomic segment, seems insufficient, predominantly due to poor sequence consensus as well as limitations of training on available experimental data sets. We present here a novel concept for characterizing exon-intron boundaries in genomic segments on the basis of structural and energetic properties. We analyzed boundary junctions on both sides of all the exons (3 28 368) of protein coding genes from human genome (GENCODE database) using 28 structural and three energy parameters. Study of sequence conservation at these sites shows very poor consensus. It is observed that DNA adopts a unique structural and energy state at the boundary junctions. Also, signals are somewhat different for housekeeping and tissue specific genes. Clustering of 31 parameters into four derived vectors gives some additional insights into the physical mechanisms involved in this biological process. Sites of structural and energy signals correlate well to the positions playing important roles in pre-mRNA splicing.
Project description:BACKGROUND:In-depth study of the intron retention levels of transcripts provide insights on the mechanisms regulating pre-mRNA splicing efficiency. Additionally, detailed analysis of retained introns can link these introns to post-transcriptional regulation or identify aberrant splicing events in human diseases. RESULTS:We present IntEREst, Intron-Exon Retention Estimator, an R package that supports rigorous analysis of non-annotated intron retention events (in addition to the ones annotated by RefSeq or similar databases), and support intra-sample in addition to inter-sample comparisons. It accepts binary sequence alignment/map (.bam) files as input and determines genome-wide estimates of intron retention or exon-exon junction levels. Moreover, it includes functions for comparing subsets of user-defined introns (e.g. U12-type vs U2-type) and its plotting functions allow visualization of the distribution of the retention levels of the introns. Statistical methods are adapted from the DESeq2, edgeR and DEXSeq R packages to extract the significantly more or less retained introns. Analyses can be performed either sequentially (on single core) or in parallel (on multiple cores). We used IntEREst to investigate the U12- and U2-type intron retention in human and plant RNAseq dataset with defects in the U12-dependent spliceosome due to mutations in the ZRSR2 component of this spliceosome. Additionally, we compared the retained introns discovered by IntEREst with that of other methods and studies. CONCLUSION:IntEREst is an R package for Intron retention and exon-exon junction levels analysis of RNA-seq data. Both the human and plant analyses show that the U12-type introns are retained at higher level compared to the U2-type introns already in the control samples, but the retention is exacerbated in patient or plant samples carrying a mutated ZRSR2 gene. Intron retention events caused by ZRSR2 mutation that we discovered using IntEREst (DESeq2 based function) show considerable overlap with the retained introns discovered by other methods (e.g. IRFinder and edgeR based function of IntEREst). Our results indicate that increase in both the number of biological replicates and the depth of sequencing library promote the discovery of retained introns, but the effect of library size gradually decreases with more than 35 million reads mapped to the introns.
Project description:The Exon/Intron (ExInt) database incorporates information on the exon/intron structure of eukaryotic genes. Features in the database include: intron nucleotide sequence, amino acid sequence of the corresponding protein, position of the introns at the amino acid level and intron phase. From ExInt, we have also generated four additional databases each with ExInt entries containing predicted introns, introns experimentally defined, organelle introns or nuclear introns. ExInt is accessible through a retrieval system with pointers to GenBank. The database can be searched by keywords, locus name, NID, accession number or length of the protein. ExInt is freely accessible at http://intron.bic.nus.edu.sg/exint/exint.html
Project description:Pathogenic variants in KCNQ2 encoding for Kv7.2 potassium channel subunits have been found in patients affected by widely diverging epileptic phenotypes, ranging from Self-Limiting Familial Neonatal Epilepsy (SLFNE) to severe Developmental and Epileptic Encephalopathy (DEE). Thus, understanding the pathogenic molecular mechanisms of KCNQ2 variants and their correlation with clinical phenotypes has a relevant impact on the clinical management of these patients. In the present study, the genetic, biochemical, and functional effects prompted by two variants, each found in a non-familial SLNE or a DEE patient but both affecting nucleotides at the KCNQ2 intron 6-exon 7 boundary, have been investigated to test whether and how they affected the splicing process and to clarify whether such mechanism might play a pathogenetic role in these patients. Analysis of KCNQ2 mRNA splicing in patient-derived lymphoblasts revealed that the SLNE-causing intronic variant (c.928-1G > C) impeded the use of the natural splice site, but lead to a 10-aa Kv7.2 in frame deletion (Kv7.2 p.G310Δ10); by contrast, the DEE-causing exonic variant (c.928G > A) only had subtle effects on the splicing process at this site, thus leading to the synthesis of a full-length subunit carrying the G310S missense variant (Kv7.2 p.G310S). Patch-clamp recordings in transiently-transfected CHO cells and primary neurons revealed that both variants fully impeded Kv7.2 channel function, and exerted strong dominant-negative effects when co-expressed with Kv7.2 and/or Kv7.3 subunits. Notably, Kv7.2 p.G310S, but not Kv7.2 p.G310Δ10, currents were recovered upon overexpression of the PIP2-synthesizing enzyme PIP5K, and/or CaM; moreover, currents from heteromeric Kv7.2/Kv7.3 channels incorporating either Kv7.2 mutant subunits were differentially regulated by changes in PIP2 availability, with Kv7.2/Kv7.2 G310S/Kv7.3 currents showing a greater sensitivity to PIP2 depletion when compared to those from Kv7.2/Kv7.2 G310Δ10/Kv7.3 channels. Altogether, these results suggest that the two variants investigated differentially affected the splicing process at the intron 6-exon 7 boundary, and led to the synthesis of Kv7.2 subunits showing a differential sensitivity to PIP2 and CaM regulation; more studies are needed to clarify how such different functional properties contribute to the widely-divergent clinical phenotypes.