Dataset Information

A long-read RNA-seq approach to identify novel transcripts of very large genes.

ABSTRACT: RNA-seq is widely used for studying gene expression, but commonly used sequencing platforms produce short reads that only span up to two exon junctions per read. This makes it difficult to accurately determine the composition and phasing of exons within transcripts. Although long-read sequencing improves this issue, it is not amenable to precise quantitation, which limits its utility for differential expression studies. We used long-read isoform sequencing combined with a novel analysis approach to compare alternative splicing of large, repetitive structural genes in muscles. Analysis of muscle structural genes that produce medium (Nrap: 5 kb), large (Neb: 22 kb), and very large (Ttn: 106 kb) transcripts in cardiac muscle, and fast and slow skeletal muscles identified unannotated exons for each of these ubiquitous muscle genes. This also identified differential exon usage and phasing for these genes between the different muscle types. By mapping the in-phase transcript structures to known annotations, we also identified and quantified previously unannotated transcripts. Results were confirmed by endpoint PCR and Sanger sequencing, which revealed muscle-type-specific differential expression of these novel transcripts. The improved transcript identification and quantification shown by our approach removes previous impediments to studies aimed at quantitative differential expression of ultralong transcripts.

SUBMITTER: Uapinyoying P

PROVIDER: S-EPMC7370890 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A long-read RNA-seq approach to identify novel transcripts of very large genes.

Uapinyoying Prech P Goecks Jeremy J Knoblach Susan M SM Panchapakesan Karuna K Bonnemann Carsten G CG Partridge Terence A TA Jaiswal Jyoti K JK Hoffman Eric P EP

Genome research 20200601 6

RNA-seq is widely used for studying gene expression, but commonly used sequencing platforms produce short reads that only span up to two exon junctions per read. This makes it difficult to accurately determine the composition and phasing of exons within transcripts. Although long-read sequencing improves this issue, it is not amenable to precise quantitation, which limits its utility for differential expression studies. We used long-read isoform sequencing combined with a novel analysis approach ...[more]

PMID: 32660935

Dataset Information

A long-read RNA-seq approach to identify novel transcripts of very large genes.

Publications

A long-read RNA-seq approach to identify novel transcripts of very large genes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A long-read RNA-seq approach to identify novel transcripts of very large genes
2020-05-17 | GSE138362 | GEO

A long-read RNA-seq approach to identify novel transcripts of very large genes
| PRJNA575604 | ENA

De novo annotation of lncRNA HOTAIR transcripts by long-read RNA capture-seq reveals a differentiation-driven isoform switch.
| S-EPMC9482196 | biostudies-literature

Targeted DNA-seq and RNA-seq of Reference Samples with Short-read and Long-read Sequencing.
| S-EPMC11329654 | biostudies-literature

L-GIREMI uncovers RNA editing sites in long-read RNA-seq.
| S-EPMC10360234 | biostudies-literature

Transcriptome assembly from long-read RNA-seq alignments with StringTie2.
| S-EPMC6912988 | biostudies-literature

Evaluation of tools for long read RNA-seq splice-aware alignment.
| S-EPMC6192213 | biostudies-literature

Extension of human lncRNA transcripts by RACE coupled with long-read high-throughput sequencing (RACE-Seq).
| S-EPMC4992054 | biostudies-literature

PSI-Sigma: a comprehensive splicing-detection method for short-read and long-read RNA-seq analysis.
| S-EPMC6901072 | biostudies-literature

Context-aware transcript quantification from long-read RNA-seq data with Bambu.
| S-EPMC10448944 | biostudies-literature