Dataset Information

The full-length transcriptome of C. elegans using direct RNA sequencing.

ABSTRACT: Current transcriptome annotations have largely relied on short read lengths intrinsic to the most widely used high-throughput cDNA sequencing technologies. For example, in the annotation of the Caenorhabditis elegans transcriptome, more than half of the transcript isoforms lack full-length support and instead rely on inference from short reads that do not span the full length of the isoform. We applied nanopore-based direct RNA sequencing to characterize the developmental polyadenylated transcriptome of C. elegans Taking advantage of long reads spanning the full length of mRNA transcripts, we provide support for 23,865 splice isoforms across 14,611 genes, without the need for computational reconstruction of gene models. Of the isoforms identified, 3452 are novel splice isoforms not present in the WormBase WS265 annotation. Furthermore, we identified 16,342 isoforms in the 3' untranslated region (3' UTR), 2640 of which are novel and do not fall within 10 bp of existing 3'-UTR data sets and annotations. Combining 3' UTRs and splice isoforms, we identified 28,858 full-length transcript isoforms. We also determined that poly(A) tail lengths of transcripts vary across development, as do the strengths of previously reported correlations between poly(A) tail length and expression level, and poly(A) tail length and 3'-UTR length. Finally, we have formatted this data as a publicly accessible track hub, enabling researchers to explore this data set easily in a genome browser.

SUBMITTER: Roach NP

PROVIDER: S-EPMC7050520 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

The full-length transcriptome of <i>C. elegans</i> using direct RNA sequencing.

Roach Nathan P NP Sadowski Norah N Alessi Amelia F AF Timp Winston W Taylor James J Kim John K JK

Genome research 20200205 2

Current transcriptome annotations have largely relied on short read lengths intrinsic to the most widely used high-throughput cDNA sequencing technologies. For example, in the annotation of the <i>Caenorhabditis elegans</i> transcriptome, more than half of the transcript isoforms lack full-length support and instead rely on inference from short reads that do not span the full length of the isoform. We applied nanopore-based direct RNA sequencing to characterize the developmental polyadenylated t ...[more]

PMID: 32024661

Dataset Information

The full-length transcriptome of C. elegans using direct RNA sequencing.

Publications

The full-length transcriptome of <i>C. elegans</i> using direct RNA sequencing.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Direct full-length RNA sequencing reveals unexpected transcriptome complexity during <i>Caenorhabditis elegans</i> development.
| S-EPMC7050527 | biostudies-literature

Direct full-length RNA sequencing reveals unexpected transcriptome complexity during C. elegans development
2019-09-10 | GSE130044 | GEO

Full-length direct RNA sequencing reveals extensive remodeling of RNA expression, processing and modification in aging Caenorhabditis elegans.
| S-EPMC11662692 | biostudies-literature

Full-length direct RNA sequencing reveals extensive remodeling of RNA expression, processing and modification in aging <i>Caenorhabditis elegans</i>.
| S-EPMC11213008 | biostudies-literature

Full-length direct RNA sequencing uncovers stress granule-dependent RNA decay upon cellular stress.
| S-EPMC11658763 | biostudies-literature

An Improved RNA Extraction Protocol for Rye Grain Full-Length Transcriptome Sequencing.
| S-EPMC11642000 | biostudies-literature

Realizing the potential of full-length transcriptome sequencing.
| S-EPMC6792442 | biostudies-literature

Full-length transcriptome profiling for fruit development in Diospyros oleifera using nanopore sequencing.
| S-EPMC10012491 | biostudies-literature

Direct Nanopore Sequencing of Individual Full Length tRNA Strands.
| S-EPMC10189790 | biostudies-literature

Direct full-length RNA sequencing reveals unexpected transcriptome complexity during C. elegans development
| PRJNA533634 | ENA