Project description:Sequencing was performed to assess the ability of Nanopore direct cDNA and native RNA sequencing to characterise human transcriptomes. Total RNA was extracted from either HAP1 or HEK293 cells, and the polyA+ fraction isolated using oligodT dynabeads. Libraries were prepared using Oxford Nanopore Technologies (ONT) kits according to manufacturers instructions. Samples were then sequenced on ONT R9.4 flow cells to generate fast5 raw reads in the ONT MinKNOW software. Fast5 reads were then base-called using the ONT Albacore software to generate Fastq reads.
Project description:Rapidly increased studies by third-generation sequencing [Pacific Biosciences (Pacbio) and Oxford Nanopore Technologies (ONT)] have been used in all kinds of research areas. Among them, the plant full-length single-molecule transcriptome studies were most used by Pacbio while ONT was rarely used. Therefore, in this study, we developed ONT RNA-sequencing methods in plants. We performed a detailed evaluation of reads from Pacbio and Nanopore PCR cDNA (ONT Pc) sequencing in plants (Arabidopsis), including the characteristics of raw data and identification of transcripts. We aimed to provide a valuable reference for applications of ONT in plant transcriptome analysis.
Project description:We used the nanopore Cas9 targeted sequencing (nCATS) strategy to specifically sequence 125 L1HS-containing loci in parallel and measure their DNA methylation levels using nanopore long-read sequencing. Each targeted locus is sequenced at high coverage (~45X) with unambiguously mapped reads spanning the entire L1 element, as well as its flanking sequences over several kilobases. The genome-wide profile of L1 methylation was also assessed by bs-ATLAS-seq in the same cell lines (E-MTAB-10895).
Project description:BackgroundNanopore sequencing provides a rapid, cheap and portable real-time sequencing platform with the potential to revolutionize genomics. However, several applications are limited by relatively high single-read error rates (>10 %), including RNA-seq, haplotype sequencing and 16S sequencing.ResultsWe developed the Intramolecular-ligated Nanopore Consensus Sequencing (INC-Seq) as a strategy for obtaining long and accurate nanopore reads, starting with low input DNA. Applying INC-Seq for 16S rRNA-based bacterial profiling generated full-length amplicon sequences with a median accuracy >97 %.ConclusionsINC-Seq reads enabled accurate species-level classification, identification of species at 0.1 % abundance and robust quantification of relative abundances, providing a cheap and effective approach for pathogen detection and microbiome profiling on the MinION system.
Project description:We have performed a Proteogenomics meta-analysis of data sets deposited in ProteomeXchange: PXD000265, PXD000313, PXD000923, PXD001030, PXD001058, PXD002291, PXD002739, PXD002740 and PXD003156 and using 29 RNA-Seq data sets on rice (Oryza sativa). We created a search database comprising translated reads that had been mapped onto the rice genome, as well as officially annotated rice proteins sequences. The RNA Seq database was pre-processed to identify “novel transcripts” for those not mapping fully to an existing exon, and “novel junctions” for those reads mapped with a gap, implying a potential novel splice site that was not annotated in the official gene set. Confidentially identified “novel peptides” i.e. those mapping to a novel junction or novel transcript were post-processed to ensure that there were no other better explanations for the corresponding spectra e.g. peptide from a canonical gene with a modification or amino acid substitution. Data were exported from the pipeline in PSI mzIdentML 1.2 format, containing chromosomal coordinates, and further converted to PSI proBed format for genome visualisation. Novel peptides were searched against other plant databases using BLAST to see if they had predicted in genes from other species. A total of 1584 novel peptides were identified, mapping to ~700 genomic loci in which either new genes have been predicted (~100) or updates to existing gene models have been predicted (~600).
Project description:The mouse monoclonal anti-Ro60 antibody (sc-100844, Sants Cruz Biotechnology, Inc) has been validated by coimmunoprecipitation from total Flp-In T-REx 293 cell lysates. An anti-His antibody (SAB1305538, Sigma-Aldrich) was used as a negative control (see experimental details in Jeandard et al., "CoLoC-seq probes the global topology of organelle transcriptomes", manuscript in preparation). Ro60 was found to be the most strongly enriched protein (~23-fold enrichment). Low-level co-enrichment of other proteins (including mitochondria-localised ones) has also been observed. These data validate the specificity of the anti-Ro60 antibody.
Project description:In this study, based on Nanopore direct RNA-seq where native RNAs are sequenced directly as near full-length transcripts in the 3' to 5' direction, transcription units of the phytopathogen Dickeya dadantii 3937 were validated and transcriptional termination sites were determined. Briefly, D. dadantii cultures were grown in M63 medium supplemented with 0.2% glucose and 0.2% PGA, until the early exponential phase (A600nm = 0.2, condition 1), or the early stationary phase (A600nm = 1.8, condition 2). RNAs were extracted using a frozen acid-phenol method, as previously described (Hommais et al. 2008), and treated successively with Roche and Biolabs DNases. Two samples were prepared: 50 µg of RNAs from each condition were pulled into one sample (sample 1), whereas the other one contained 100 µg of RNAs from condition 2 (sample 2). Both samples were then supplied to Vertis Biotechnologie AG for Nanopore native RNA-seq: total RNA preparations were first examined by capillary electrophoresis. For sample 1, ribosomal RNA molecules were depleted using an in-house developed protocol (recovery rate = 84%), whereas no ribodepletion was performed for sample 2. 3' ends of RNA were then poly(A)-tailed using poly(A) polymerase, and the Direct RNA sequencing kit (SQK-RNA002) was used to prepare the library for 1D sequencing on the Oxford Nanopore sequencing device. The direct RNA libraries were sequenced on a MinION device (MIN-101B) using standard settings. Basecalling of the fast5 files was performed using Guppy (version 3.6.1) with the following settings: --flowcell FLO-MIN106 --kit SQK-RNA002 --cpu_threads_per_caller 12--compress_fastq --reverse_sequence true --trim_strategy rna. Reads smaller than 50 nt were removed. 466 393 and 556 850 reads were generated for sample 1 and 2, respectively.
Project description:Probing epigenetic features on long molecules of DNA has tremendous potential to advance our understanding of the phased epigenome. In this study, we evaluate CpG methylation and chromatin accessibility simultaneously on long strands of DNA using GpC methyltransferase to exogenously label open chromatin, coupled with nanopore sequencing technology. We performed nanopore sequencing of Nucleosome Occupancy and Methylome (nanoNOMe) on four human cell lines (GM12878, MCF-10A, MCF-7, MDA-MB-231), and demonstrate the ability to directly measure methylation and chromatin accessibility in genomic features such as structural variations and repetitive elements. The long single-molecule resolution allows footprinting of protein and nucleosome binding and determining the combinatorial promoter epigenetic state on individual molecules. Long-read sequencing makes it possible to robustly assign reads to haplotypes, enabling allele-specific epigenetic analysis across the genome. We use existing SNV data on GM12878 to present the first fully phased human Probing epigenetic features on long molecules of DNA has tremendous potential to advance our understanding of the phased epigenome. We evaluate CpG methylation and chromatin accessibility simultaneously on long strands of DNA using GpC methyltransferase to exogenously label open chromatin, coupled with nanopore sequencing technology. We performed nanopore sequencing of Nucleosome Occupancy and Methylome (nanoNOMe) on four human cell lines (GM12878, MCF-10A, MCF-7, MDA-MB-231), and demonstrate the ability to directly measure methylation and chromatin accessibility in genomic features such as structural variations and repetitive elements. The long single-molecule resolution allows footprinting of protein and nucleosome binding and determining the combinatorial promoter epigenetic state on individual molecules. Long-read sequencing makes it possible to robustly assign reads to haplotypes, enabling allele-specific epigenetic analysis across the genome. We use existing SNV data on GM12878 to present the first fully phased human epigenome, consisting of chromosome-level allele-specific profiles of CpG methylation and chromatin accessibility.mosome-level allele-specific profiles of CpG methylation and chromatin accessibility.