Dataset Information

Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.

ABSTRACT:

Motivation

RNA sequencing (RNA-Seq) is a powerful new technology for mapping and quantifying transcriptomes using ultra high-throughput next-generation sequencing technologies. Using deep sequencing, gene expression levels of all transcripts including novel ones can be quantified digitally. Although extremely promising, the massive amounts of data generated by RNA-Seq, substantial biases and uncertainty in short read alignment pose challenges for data analysis. In particular, large base-specific variation and between-base dependence make simple approaches, such as those that use averaging to normalize RNA-Seq data and quantify gene expressions, ineffective.

Results

In this study, we propose a Poisson mixed-effects (POME) model to characterize base-level read coverage within each transcript. The underlying expression level is included as a key parameter in this model. Since the proposed model is capable of incorporating base-specific variation as well as between-base dependence that affect read coverage profile throughout the transcript, it can lead to improved quantification of the true underlying expression level.

Availability and implementation

POME can be freely downloaded at http://www.stat.purdue.edu/~yuzhu/pome.html.

Contact

yuzhu@purdue.edu; zhaohui.qin@emory.edu

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Hu M

PROVIDER: S-EPMC3244770 | biostudies-literature | 2012 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.

Hu Ming M Zhu Yu Y Taylor Jeremy M G JM Liu Jun S JS Qin Zhaohui S ZS

Bioinformatics (Oxford, England) 20111108 1

<h4>Motivation</h4>RNA sequencing (RNA-Seq) is a powerful new technology for mapping and quantifying transcriptomes using ultra high-throughput next-generation sequencing technologies. Using deep sequencing, gene expression levels of all transcripts including novel ones can be quantified digitally. Although extremely promising, the massive amounts of data generated by RNA-Seq, substantial biases and uncertainty in short read alignment pose challenges for data analysis. In particular, large base- ...[more]

PMID: 22072384

Dataset Information

Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.

Motivation

Results

Availability and implementation

Contact

Supplementary information

Publications

Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown.
| S-EPMC5032908 | biostudies-literature

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences.
| S-EPMC4712774 | biostudies-literature

Modelling RNA-Seq data with a zero-inflated mixture Poisson linear model.
| S-EPMC6763381 | biostudies-literature

Polyester: simulating RNA-seq datasets with differential transcript expression.
| S-EPMC4635655 | biostudies-literature

A longitudinal Bayesian mixed effects model with hurdle Conway-Maxwell-Poisson distribution.
| S-EPMC9167575 | biostudies-literature

A Poisson Log-Normal Model for Constructing Gene Covariation Network Using RNA-seq Data.
| S-EPMC5510689 | biostudies-literature

A two-parameter generalized Poisson model to improve the analysis of RNA-seq data.
| S-EPMC2943596 | biostudies-literature

Sample size calculation for differential expression analysis of RNA-seq data under Poisson distribution.
| S-EPMC3874726 | biostudies-literature

Power analysis for RNA-Seq differential expression studies using generalized linear mixed effects models.
| S-EPMC7236949 | biostudies-literature

GeneFriends: a human RNA-seq-based gene and transcript co-expression database.
| S-EPMC4383890 | biostudies-literature