Dataset Information

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.

ABSTRACT: Application of Oxford Nanopore Technologies' long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistical testing. Here, we report the analysis of two nanopore RNA-seq datasets with the goal of obtaining gene- and isoform-level differential expression information. A dataset of synthetic, spliced, spike-in RNAs ('sequins') as well as a mouse neural stem cell dataset from samples with a null mutation of the epigenetic regulator Smchd1 was analysed using a mix of long-read specific tools for preprocessing together with established short-read RNA-seq methods for downstream analysis. We used limma-voom to perform differential gene expression analysis, and the novel FLAMES pipeline to perform isoform identification and quantification, followed by DRIMSeq and limma-diffSplice (with stageR) to perform differential transcript usage analysis. We compared results from the sequins dataset to the ground truth, and results of the mouse dataset to a previous short-read study on equivalent samples. Overall, our work shows that transcriptomic analysis of long-read nanopore data using long-read specific preprocessing methods together with short-read differential expression methods and software that are already in wide use can yield meaningful results.

SUBMITTER: Dong X

PROVIDER: S-EPMC8074342 | biostudies-literature | 2021 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.

Dong Xueyi X Tian Luyi L Gouil Quentin Q Kariyawasam Hasaru H Su Shian S De Paoli-Iseppi Ricardo R Prawer Yair David Joseph YDJ Clark Michael B MB Breslin Kelsey K Iminitoff Megan M Blewitt Marnie E ME Law Charity W CW Ritchie Matthew E ME

NAR genomics and bioinformatics 20210426 2

Application of Oxford Nanopore Technologies' long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistical testing. Here, we report the analysis of two nanopore RNA-seq datasets with the goal of obtaining gene- and isoform-level differential expression information. A dataset of synthetic, spliced, spike ...[more]

PMID: 33937765

Dataset Information

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.

Publications

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Unlocking short read sequencing for metagenomics.
| S-EPMC2911387 | biostudies-literature

Exploring differential exon usage via short- and long-read RNA sequencing strategies.
| S-EPMC9516339 | biostudies-literature

Long-read single-molecule RNA structure sequencing using nanopore.
| S-EPMC9723614 | biostudies-literature

NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy.
| S-EPMC7568507 | biostudies-literature

Oxford nanopore long-read sequencing enables the generation of complete bacterial and plasmid genomes without short-read sequencing.
| S-EPMC10225699 | biostudies-literature

Generation of full-length circular RNA libraries for Oxford Nanopore long-read sequencing.
| S-EPMC9451095 | biostudies-literature

Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates.
| S-EPMC11232133 | biostudies-literature

Engineering psychrophilic polymerase for nanopore long-read sequencing.
| S-EPMC11246872 | biostudies-literature

Differential gene expression analysis tools exhibit substandard performance for long non-coding RNA-sequencing data.
| S-EPMC6058388 | biostudies-literature

NanoMod: a computational tool to detect DNA modifications using Nanopore long-read sequencing data.
| S-EPMC6360650 | biostudies-literature