Project description:Long-read Nanopore cDNA sequencing of polyA-enriched RNA was implemented in a range of adult tissues isolated from cattle, pig, and chicken. These data were used to identify and characterize the expression patterns of full-length transcript isoforms.
Project description:This dataset contains Xdrop followed by oxford nanopore long read sequencing performed in target tRNA gene deletion (t8) and intergenic region deletion (i50) clones in HepG2 . By applying de novo assembly based approach to Xdrop-LRS data, we identified Cas9-induced on-target genomic alteration.
Project description:This dataset contains Xdrop followed by oxford nanopore long read sequencing performed in target tRNA gene deletion clones in HAP1 (t72) and HepG2 (t15). By applying de novo assembly based approach to Xdrop-LRS data, we identified Cas9-induced on-target genomic alteration.
Project description:Evaluation of short-read-only, long-read-only, and hybrid assembly approaches on metagenomic samples demonstrating how they affect gene and protein prediction which is relevant for downstream functional analyses. For a human gut microbiome sample, we use complementary metatranscriptomic, and metaproteomic data to evaluate the metagenomic-based protein predictions.
Project description:Purpose: To generate a reference long-read transcriptomic data set for use in developing new analysis pipelines and comparing their performance with existing methods. Synthetic “sequin” RNA standards (Hardwick et al. 2016) were sequenced using the Oxford Nanopore Technologies (ONT) GridION platform.
Project description:Ongoing improvements to next generation sequencing technologies are leading to longer sequencing read lengths, but a thorough understanding of the impact of longer reads on RNA sequencing analyses is lacking. To address this issue, we generated and compared two RNA sequencing datasets of differing read lengths -- 2x75 bp (L75) and 2x262 bp (L262) -- and investigated the impact of read length on various aspects of analysis, including the performance of currently available read-mapping tools, gene and transcript quantification, and detection of allele-specific expression patterns. Our results indicate that, while the scalability of read-mapping tools and the cost-effectiveness of long read protocol is an issue that requires further attention, longer reads enable more accurate quantification of diverse aspects of gene expression, including individual-specific patterns of allele-specific expression and alternative splicing. Two RNA-Seq datasets of differing read lengths (2x262 bp and 2x75 bp)
Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.