Project description:The field of epitranscriptomics has undergone an enormous expansion in the last few years; however, a major limitation is the lack of generic methods to map RNA modifications transcriptome-wide. Here we show that using Oxford Nanopore Technologies, N6-methyladenosine (m6A) RNA modifications can be detected with high accuracy, in the form of systematic errors and decreased base-calling qualities. Our results open new avenues to investigate the universe of RNA modifications with single nucleotide resolution, in individual RNA molecules.
Project description:The field of epitranscriptomics has experienced a major revolution in the last few years; however, a major limitation is the lack of generic methods to map RNA modifications transcriptome-wide. Here we show that using the platform offered by Oxford Nanopore Technologies, we can detect N6-methyladenosine (m6A) RNA modifications with high accuracy. Our results open new avenues to investigate the universe of RNA modifications with single nucleotide, single molecule resolution.
Project description:The epitranscriptomics field has undergone an enormous expansion in the last few years; however, a major limitation is the lack of generic methods to map RNA modifications transcriptome-wide. Here, we show that using direct RNA sequencing, N6-methyladenosine (m6A) RNA modifications can be detected with high accuracy, in the form of systematic errors and decreased base-calling qualities. Specifically, we find that our algorithm, trained with m6A-modified and unmodified synthetic sequences, can predict m6A RNA modifications with ~90% accuracy. We then extend our findings to yeast data sets, finding that our method can identify m6A RNA modifications in vivo with an accuracy of 87%. Moreover, we further validate our method by showing that these 'errors' are typically not observed in yeast ime4-knockout strains, which lack m6A modifications. Our results open avenues to investigate the biological roles of RNA modifications in their native RNA context.
Project description:N6-methyladenosine (m6A) is the most abundant internal RNA modification in eukaryotic mRNAs and influences many aspects of RNA processing. miCLIP (m6A individual-nucleotide resolution UV crosslinking and immunoprecipitation) is an antibody-based approach to map m6A sites with single-nucleotide resolution. However, due to broad antibody reactivity, reliable identification of m6A sites from miCLIP data remains challenging. Here, we present miCLIP2 in combination with machine learning to significantly improve m6A detection. The optimized miCLIP2 results in high-complexity libraries from less input material. Importantly, we established a robust computational pipeline to tackle the inherent issue of false positives in antibody-based m6A detection. The analyses were calibrated with Mettl3 knockout cells to learn the characteristics of m6A deposition, including m6A sites outside of DRACH motifs. To make our results universally applicable, we trained a machine learning model, m6Aboost, based on the experimental and RNA sequence features. Importantly, m6Aboost allows prediction of genuine m6A sites in miCLIP2 data without filtering for DRACH motifs or the need for Mettl3 depletion. Using m6Aboost, we identify thousands of high-confidence m6A sites in different murine and human cell lines, which provide a rich resource for future analysis. Collectively, our combined experimental and computational methodology greatly improves m6A identification.
Project description:Bacterial DNA is subject to various modifications involved in gene regulation and defense against bacteriophage attacks. Phosphorothioate (PT) modifications are protective modifications in which the non-bridging oxygen in the DNA phosphate backbone is replaced with a sulfur atom. Here, we expand third-generation sequencing techniques to allow for the sequence-specific mapping of DNA modifications by demonstrating the application of Oxford Nanopore Technologies (ONT) and the ELIGOS software package for site-specific detection and characterization of PT modifications. The ONT/ELIGOS platform accurately detected PT modifications in a plasmid carrying synthetic PT modifications. Subsequently, studies were extended to the genome-wide mapping of PT modifications in the Salmonella enterica genomes within the wild-type strain and strains lacking the PT regulatory gene dndB (ΔdndB) or the PT synthetic gene dndC (ΔdndC). PT site-specific signatures were observed in the established motifs of GAAC/GTTC. The PT site locations were in close agreement with PT sites previously identified using the Nick-seq technique. Compared to the wild-type strain, the number of PT modifications are 1.8-fold higher in ΔdndB and 25-fold lower in ΔdndC, again consistent with known regulation of the dnd operon. These results demonstrate the suitability of the ONT platform for accurate detection and identification of the unusual PT backbone modifications in native genome sequences.
Project description:N6-methyladenosine (m6A) is the most abundant internal RNA modification in eukaryotic mRNAs and influences many aspects of RNA processing, such as RNA stability and translation. miCLIP (m6A individual-nucleotide resolution UV crosslinking and immunoprecipitation) is an antibody-based approach to map m6A sites in the transcriptome with single-nucleotide resolution. However, due to broad antibody reactivity, reliable identification of m6A sites from miCLIP data remains challenging. Here, we present several experimental and computational innovations that significantly improve transcriptome-wide detection of m6A sites. Based on the recently developed iCLIP2 protocol, the optimised miCLIP2 results in high-complexity libraries using less input material, leading to a more comprehensive representation of m6A sites. Next, we established a robust computational pipeline to identify true m6A sites from our miCLIP2 data. The analyses are calibrated with data from Mettl3 knockout cells to learn the characteristics of m6A deposition, including a significant number of m6A sites outside of DRACH motifs. In order to make these results universally applicable, we trained a machine learning model, m6Aboost, based on the experimental and RNA sequence features. Importantly, m6Aboost allows prediction of genuine m6A sites in miCLIP data without filtering for DRACH motifs or the need for Mettl3 depletion. Using m6Aboost, we identify thousands of high-confidence m6A sites in different murine and human cell lines, which provide a rich resource for future analysis. Collectively, our combined experimental and computational methodology greatly improves m6A identification.
Project description:N6-methyladenosine (m6A) is the most abundant internal RNA modification in eukaryotic mRNAs and influences many aspects of RNA processing, such as RNA stability and translation. miCLIP (m6A individual-nucleotide resolution UV crosslinking and immunoprecipitation) is an antibody-based approach to map m6A sites in the transcriptome with single-nucleotide resolution. However, due to broad antibody reactivity, reliable identification of m6A sites from miCLIP data remains challenging. Here, we present several experimental and computational innovations that significantly improve transcriptome-wide detection of m6A sites. Based on the recently developed iCLIP2 protocol, the optimised miCLIP2 results in high-complexity libraries using less input material, leading to a more comprehensive representation of m6A sites. Next, we established a robust computational pipeline to identify true m6A sites from our miCLIP2 data. The analyses are calibrated with data from Mettl3 knockout cells to learn the characteristics of m6A deposition, including a significant number of m6A sites outside of DRACH motifs. In order to make these results universally applicable, we trained a machine learning model, m6Aboost, based on the experimental and RNA sequence features. Importantly, m6Aboost allows prediction of genuine m6A sites in miCLIP data without filtering for DRACH motifs or the need for Mettl3 depletion. Using m6Aboost, we identify thousands of high-confidence m6A sites in different murine and human cell lines, which provide a rich resource for future analysis. Collectively, our combined experimental and computational methodology greatly improves m6A identification.
Project description:N6-methyladenosine (m6A) is the most abundant internal RNA modification in eukaryotic mRNAs and influences many aspects of RNA processing, such as RNA stability and translation. miCLIP (m6A individual-nucleotide resolution UV crosslinking and immunoprecipitation) is an antibody-based approach to map m6A sites in the transcriptome with single-nucleotide resolution. However, due to broad antibody reactivity, reliable identification of m6A sites from miCLIP data remains challenging. Here, we present several experimental and computational innovations that significantly improve transcriptome-wide detection of m6A sites. Based on the recently developed iCLIP2 protocol, the optimised miCLIP2 results in high-complexity libraries using less input material, leading to a more comprehensive representation of m6A sites. Next, we established a robust computational pipeline to identify true m6A sites from our miCLIP2 data. The analyses are calibrated with data from Mettl3 knockout cells to learn the characteristics of m6A deposition, including a significant number of m6A sites outside of DRACH motifs. In order to make these results universally applicable, we trained a machine learning model, m6Aboost, based on the experimental and RNA sequence features. Importantly, m6Aboost allows prediction of genuine m6A sites in miCLIP data without filtering for DRACH motifs or the need for Mettl3 depletion. Using m6Aboost, we identify thousands of high-confidence m6A sites in different murine and human cell lines, which provide a rich resource for future analysis. Collectively, our combined experimental and computational methodology greatly improves m6A identification.