Dataset Information

Comparison of alternative approaches for analysing multi-level RNA-seq data.

ABSTRACT: RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigorous quantification of genes/transcripts to allow a reliable prediction of differential expression (DE), despite variation in levels of noise and inherent biases in sequencing data. This can be especially challenging for datasets in which gene expression differences are subtle, as in the behavioural transcriptomics test dataset from D. melanogaster that we used here. We investigated the power of existing approaches for quality checking mRNA-seq data and explored additional, quantitative quality checks. To accommodate nested, multi-level experimental designs, we incorporated sample layout into our analyses. We employed a subsampling without replacement-based normalization and an identification of DE that accounted for the hierarchy and amplitude of effect sizes within samples, then evaluated the resulting differential expression call in comparison to existing approaches. In a final step to test for broader applicability, we applied our approaches to a published set of H. sapiens mRNA-seq samples, The dataset-tailored methods improved sample comparability and delivered a robust prediction of subtle gene expression changes. The proposed approaches have the potential to improve key steps in the analysis of RNA-seq data by incorporating the structure and characteristics of biological experiments.

SUBMITTER: Mohorianu I

PROVIDER: S-EPMC5549751 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Comparison of alternative approaches for analysing multi-level RNA-seq data.

Mohorianu Irina I Bretman Amanda A Smith Damian T DT Fowler Emily K EK Dalmay Tamas T Chapman Tracey T

PloS one 20170808 8

RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigorous quantification of genes/transcripts to allow a reliable prediction of differential expression (DE), despite variation in levels of noise and inherent biases in sequencing data. This can be especial ...[more]

PMID: 28792517

Dataset Information

Comparison of alternative approaches for analysing multi-level RNA-seq data.

Publications

Comparison of alternative approaches for analysing multi-level RNA-seq data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Approaches for sRNA Analysis of Human RNA-Seq Data: Comparison, Benchmarking.
| S-EPMC9959513 | biostudies-literature

Quantitative visualization of alternative exon expression from RNA-seq data.
| S-EPMC4542614 | biostudies-literature

Comparison of transformations for single-cell RNA-seq data.
| S-EPMC10172138 | biostudies-literature

Comparative evaluation of gene set analysis approaches for RNA-Seq data.
| S-EPMC4265362 | biostudies-literature

GeneTEFlow: A Nextflow-based pipeline for analysing gene and transposable elements expression from RNA-Seq data.
| S-EPMC7458328 | biostudies-literature

KISSPLICE: de-novo calling alternative splicing events from RNA-seq data.
| S-EPMC3358658 | biostudies-literature

Detecting Allele-Specific Alternative Splicing from Population-Scale RNA-Seq Data.
| S-EPMC7477012 | biostudies-literature

Profiling Alternative 3' Untranslated Regions in Sorghum using RNA-seq Data.
| S-EPMC7649775 | biostudies-literature

BayesPeak--an R package for analysing ChIP-seq data.
| S-EPMC3042177 | biostudies-literature

Methodological approaches for analysing data from therapeutic efficacy studies.
| S-EPMC8139079 | biostudies-literature