Project description:We identified hankyphage prophages within B. thetaiotaomicron isolates gathered from French hospitals. We extracted genomic DNA from an overnight culture from a single colony of each strain and sequenced them using Nanopore sequencing using the Plasmidsaurus platform. This long-read approach helped the assembly of the phages and determination of the hankyphage ends. We also improved the annotation of the reference hankyphage (hankyphage p00 from P. dorei HM719) using a structural prediction approach and annotated our B. thetaiotaomicron hankyphages using this new annotation. In this project we upload the genomic raw reads of nanopore sequencing of our hankyphage-bearing B. thetaiotaomicron collection (jmh strains) and the processed assembled hankyphages.
Project description:Nitrate-reducing iron(II)-oxidizing bacteria are widespread in the environment contribute to nitrate removal and influence the fate of the greenhouse gases nitrous oxide and carbon dioxide. The autotrophic growth of nitrate-reducing iron(II)-oxidizing bacteria is rarely investigated and poorly understood. The most prominent model system for this type of studies is enrichment culture KS, which originates from a freshwater sediment in Bremen, Germany. To gain insights in the metabolism of nitrate reduction coupled to iron(II) oxidation under in the absence of organic carbon and oxygen limited conditions, we performed metagenomic, metatranscriptomic and metaproteomic analyses of culture KS. Raw sequencing data of 16S rRNA amplicon sequencing, shotgun metagenomics (short reads: Illumina; long reads: Oxford Nanopore Technologies), metagenome assembly, raw sequencing data of shotgun metatranscriptomes (2 conditions, triplicates) can be found at SRA in https://www.ncbi.nlm.nih.gov/bioproject/PRJNA682552. This dataset contains proteomics data for 2 conditions (heterotrophic and autotrophic growth conditions) in triplicates.
Project description:Sequencing was performed to assess the ability of Nanopore direct cDNA and native RNA sequencing to characterise human transcriptomes. Total RNA was extracted from either HAP1 or HEK293 cells, and the polyA+ fraction isolated using oligodT dynabeads. Libraries were prepared using Oxford Nanopore Technologies (ONT) kits according to manufacturers instructions. Samples were then sequenced on ONT R9.4 flow cells to generate fast5 raw reads in the ONT MinKNOW software. Fast5 reads were then base-called using the ONT Albacore software to generate Fastq reads.
Project description:Rapidly increased studies by third-generation sequencing [Pacific Biosciences (Pacbio) and Oxford Nanopore Technologies (ONT)] have been used in all kinds of research areas. Among them, the plant full-length single-molecule transcriptome studies were most used by Pacbio while ONT was rarely used. Therefore, in this study, we developed ONT RNA-sequencing methods in plants. We performed a detailed evaluation of reads from Pacbio and Nanopore PCR cDNA (ONT Pc) sequencing in plants (Arabidopsis), including the characteristics of raw data and identification of transcripts. We aimed to provide a valuable reference for applications of ONT in plant transcriptome analysis.
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:We used the nanopore Cas9 targeted sequencing (nCATS) strategy to specifically sequence 125 L1HS-containing loci in parallel and measure their DNA methylation levels using nanopore long-read sequencing. Each targeted locus is sequenced at high coverage (~45X) with unambiguously mapped reads spanning the entire L1 element, as well as its flanking sequences over several kilobases. The genome-wide profile of L1 methylation was also assessed by bs-ATLAS-seq in the same cell lines (E-MTAB-10895).
Project description:Nanopore Sequencing and assembly of Col-0 carrying seed coat expressed GFP and RFP transgenes flanking the centromere of chromosome 3 (CTL 3.9) - additionally, DNA methylation was derived using deepsignal-plant using these reads.
Project description:To identify more targets in soybean, particularly specific targets of Cd-stress-responsive miRNAs, high-throughput degradome sequencing was used. In total, we obtained 8913111 raw reads from the library which was constructed from a mixture of four samples (HX3-CK, HX3-Cd-treatment, ZH24-CK and ZH24-Cd-treatment). After removing the reads without the CAGAG adaptor, 5430126 unique raw-reads were obtained. The unique sequences were aligned to the G. max genome database, and 6516276 reads were mapped to the genome. The mapped reads from the libraries represented 51481 annotated G. max genes.
Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.