Dataset Information

Automated identification of reference genes based on RNA-seq data.

ABSTRACT: Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to propose new RGs, but only a limited set of model species and conditions are available; on the contrary, RNA-seq experiments are more and more frequent and constitute a new source of candidate RGs.An automated workflow based on mapped NGS reads has been constructed to obtain highly and invariantly expressed RGs based on a normalized expression in reads per mapped million and the coefficient of variation. This workflow has been tested with Roche/454 reads from reproductive tissues of olive tree (Olea europaea L.), as well as with Illumina paired-end reads from two different accessions of Arabidopsis thaliana and three different human cancers (prostate, small-cell cancer lung and lung adenocarcinoma). Candidate RGs have been proposed for each species and many of them have been previously reported as RGs in literature. Experimental validation of significant RGs in olive tree is provided to support the algorithm.Regardless sequencing technology, number of replicates, and library sizes, when RNA-seq experiments are designed and performed, the same datasets can be analyzed with our workflow to extract suitable RGs for subsequent PCR validation. Moreover, different subset of experimental conditions can provide different suitable RGs.

SUBMITTER: Carmona R

PROVIDER: S-EPMC5568602 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Automated identification of reference genes based on RNA-seq data.

Carmona Rosario R Arroyo Macarena M Jiménez-Quesada María José MJ Seoane Pedro P Zafra Adoración A Larrosa Rafael R Alché Juan de Dios JD Claros M Gonzalo MG

Biomedical engineering online 20170818 Suppl 1

<h4>Background</h4>Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to pro ...[more]

PMID: 28830520

Dataset Information

Automated identification of reference genes based on RNA-seq data.

Publications

Automated identification of reference genes based on RNA-seq data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Identification of potential genes for human ischemic cardiomyopathy based on RNA-Seq data.
| S-EPMC5347674 | biostudies-literature

Systematic Selection of Reference Genes for the Normalization of Circulating RNA Transcripts in Pregnant Women Based on RNA-Seq Data.
| S-EPMC5578099 | biostudies-literature

Comparing reference-based RNA-Seq mapping methods for non-human primate data.
| S-EPMC4112205 | biostudies-literature

RNA-seq-based selection of reference genes for RT-qPCR analysis of pitaya.
| S-EPMC6668369 | biostudies-literature

Identification and Validation of Reference Genes in <i>Clostridium beijerinckii</i> NRRL B-598 for RT-qPCR Using RNA-Seq Data.
| S-EPMC8012504 | biostudies-literature

Using RNA-seq data to select reference genes for normalizing gene expression in apple roots.
| S-EPMC5608369 | biostudies-literature

Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean.
| S-EPMC4562714 | biostudies-literature

Tximeta: Reference sequence checksums for provenance identification in RNA-seq.
| S-EPMC7059966 | biostudies-literature

Identification of two putative reference genes from grapevine suitable for gene expression analysis in berry and related tissues derived from RNA-Seq data.
| S-EPMC3878734 | biostudies-literature

ARH-seq: identification of differential splicing in RNA-seq data.
| S-EPMC4132698 | biostudies-literature