Project description:We report isoCirc, a long-read sequencing strategy coupled with an integrated computational pipeline to characterize full-length circular RNA (circRNA isoforms) using rolling circle amplification (RCA) followed by long-read sequencing. Applying isoCirc to 12 human tissues, we determined full-length structures and examined tissue specificities of circRNA isoforms in human transcriptomes.
Project description:Deregulated gene expression is a hallmark of cancer, however most studies to date have analyzed short-read RNA-sequencing data with inherent limitations. Here, we combine PacBio long-read isoform sequencing (Iso-Seq) and Illumina paired-end short read RNA sequencing to comprehensively survey the transcriptome of gastric cancer (GC), a leading cause of global cancer mortality. We performed full-length transcriptome analysis across 10 GC cell lines covering four major GC molecular subtypes (chromosomal unstable, Epstein-Barr positive, genome stable and microsatellite unstable). We identify 60,239 non-redundant full-length transcripts, of which >66% are novel compared to current transcriptome databases. Novel isoforms are more likely to be cell-line and subtype specific, expressed at lower levels with larger number of exons, with longer isoform/coding sequence lengths. Most novel isoforms utilize an alternate first exon, and compared to other alternative splicing categories are expressed at higher levels and exhibit higher variability. Collectively, we observe alternate promoter usage in 25% of detected genes, with the majority (84.2%) of known/novel promoter pairs exhibiting potential changes in their coding sequences. Mapping these alternate promoters to TCGA GC samples, we identify several cancer-associated isoforms, including novel variants of oncogenes. Tumor-specific transcript isoforms tend to alter protein coding sequences to a larger extent than other isoforms. Analysis of outcome data suggests that novel isoforms may impart additional prognostic information. Our results provide a rich resource of full-length transcriptome data for deeper studies of GC and other gastrointestinal malignancies.
Project description:To examine the mechanisms that control flower development, we sequenced the flower bud transcriptomes of ‘High Noon’, a reblooming cultivar of P. suffruticosa × P. lutea. Both full-length isoforms and RNA-seq were sequenced in 3 floral developmental stages. A total of 15.94 Gb raw data and 457.0 million reads were generated in full-length transcript sequencing and RNA-seq.
Project description:While numerous studies have described the transcriptomes of EVs in different cellular contexts, these efforts have typically relied on sequencing methods requiring RNA fragmentation, which limits interpretations on the integrity and isoform diversity of EV-encapsulated RNA populations. Furthermore, it has been assumed that mRNA signatures in EVs are likely to be fragmentation products of the cellular mRNA material, and little is known about the extent to which full-length mRNAs are present within EVs. Using Oxford nanopore long-read RNA sequencing, we sought to characterize the full-length polyadenylated (poly-A) transcriptome of EVs released by human chronic myelogenous leukemia K562 cells. We detected 441 and 280 RNAs that were respectively enriched or depleted in EVs. EV-enriched poly-A transcripts consist of a variety of biotypes, including mRNAs, long non-coding RNAs, and pseudogenes. Our analysis revealed that 12.72% of all reads present in EVs corresponded to known full-length transcripts, 65.34% of which were mRNAs. We also observed that for many well-represented coding and non-coding genes, diverse full-length transcript isoforms were present in EV specimens, and these isoforms were reflective-of but often in different ratio compared to cellular samples. Here we report a full-length transcriptome from human EVs, as determined by long-read nanopore sequencing.
2023-09-27 | GSE225471 | GEO
Project description:Gossypium arboreum full length