Unknown

Dataset Information

0

Dispersion estimation and its effect on test performance in RNA-seq data analysis: a simulation-based comparison of methods.


ABSTRACT: A central goal of RNA sequencing (RNA-seq) experiments is to detect differentially expressed genes. In the ubiquitous negative binomial model for RNA-seq data, each gene is given a dispersion parameter, and correctly estimating these dispersion parameters is vital to detecting differential expression. Since the dispersions control the variances of the gene counts, underestimation may lead to false discovery, while overestimation may lower the rate of true detection. After briefly reviewing several popular dispersion estimation methods, this article describes a simulation study that compares them in terms of point estimation and the effect on the performance of tests for differential expression. The methods that maximize the test performance are the ones that use a moderate degree of dispersion shrinkage: the DSS, Tagwise wqCML, and Tagwise APL. In practical RNA-seq data analysis, we recommend using one of these moderate-shrinkage methods with the QLShrink test in QuasiSeq R package.

SUBMITTER: Landau WM 

PROVIDER: S-EPMC3857202 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Dispersion estimation and its effect on test performance in RNA-seq data analysis: a simulation-based comparison of methods.

Landau William Michael WM   Liu Peng P  

PloS one 20131209 12


A central goal of RNA sequencing (RNA-seq) experiments is to detect differentially expressed genes. In the ubiquitous negative binomial model for RNA-seq data, each gene is given a dispersion parameter, and correctly estimating these dispersion parameters is vital to detecting differential expression. Since the dispersions control the variances of the gene counts, underestimation may lead to false discovery, while overestimation may lower the rate of true detection. After briefly reviewing sever  ...[more]

Similar Datasets

| S-EPMC4302049 | biostudies-literature
| S-EPMC7192453 | biostudies-literature
| S-EPMC3608160 | biostudies-literature
| S-EPMC8021860 | biostudies-literature
| S-EPMC6134335 | biostudies-literature
| S-EPMC4435722 | biostudies-literature
| S-EPMC7325690 | biostudies-literature
| S-EPMC3543158 | biostudies-literature
| S-EPMC3656776 | biostudies-literature
| S-EPMC4983420 | biostudies-literature