Unknown

Dataset Information

0

Variance component testing for identifying differentially expressed genes in RNA-seq data.


ABSTRACT: RNA sequencing (RNA-Seq) enables the measurement and comparison of gene expression with isoform-level quantification. Differences in the effect of each isoform may make traditional methods, which aggregate isoforms, ineffective. Here, we introduce a variance component-based test that can jointly test multiple isoforms of one gene to identify differentially expressed (DE) genes, especially those with isoforms that have differential effects. We model isoform-level expression data from RNA-Seq using a negative binomial distribution and consider the baseline abundance of isoforms and their effects as two random terms. Our approach tests the global null hypothesis of no difference in any of the isoforms. The null distribution of the derived score statistic is investigated using empirical and theoretical methods. The results of simulations suggest that the performance of the proposed set test is superior to that of traditional algorithms and almost reaches optimal power when the variance of covariates is large. This method is also applied to analyze real data. Our algorithm, as a supplement to traditional algorithms, is superior at selecting DE genes with sparse or opposite effects for isoforms.

SUBMITTER: Yang S 

PROVIDER: S-EPMC5592911 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3381971 | biostudies-literature
| S-EPMC5178351 | biostudies-literature
| S-EPMC6604381 | biostudies-literature
| S-EPMC4804494 | biostudies-literature
| S-EPMC9670425 | biostudies-literature
| S-EPMC6284200 | biostudies-literature
| S-EPMC8234728 | biostudies-literature
| S-EPMC4827276 | biostudies-other
| S-EPMC7894898 | biostudies-literature
| S-EPMC5862256 | biostudies-literature