Unknown

Dataset Information

0

Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests.


ABSTRACT: Single cell RNA sequencing (scRNA-seq) allows quantitative measurement and comparison of gene expression at the resolution of single cells. Ignoring the batch effects and zero inflation of scRNA-seq data, many proposed differentially expressed (DE) methods might generate bias. We propose a method, single cell mixed model score tests (scMMSTs), to efficiently identify DE genes of scRNA-seq data with batch effects using the generalized linear mixed model (GLMM). scMMSTs treat the batch effect as a random effect. For zero inflation, scMMSTs use a weighting strategy to calculate observational weights for counts independently under zero-inflated and zero-truncated distributions. Counts data with calculated weights were subsequently analyzed using weighted GLMMs. The theoretical null distributions of the score statistics were constructed by mixed Chi-square distributions. Intensive simulations and two real datasets were used to compare edgeR-zinbwave, DESeq2-zinbwave, and scMMSTs. Our study demonstrates that scMMSTs, as supplement to standard methods, are advantageous to define DE genes of zero-inflated scRNA-seq data with batch effects.

SUBMITTER: He Z 

PROVIDER: S-EPMC7894898 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests.

He Zhiqiang Z   Pan Yueyun Y   Shao Fang F   Wang Hui H  

Frontiers in genetics 20210205


Single cell RNA sequencing (scRNA-seq) allows quantitative measurement and comparison of gene expression at the resolution of single cells. Ignoring the batch effects and zero inflation of scRNA-seq data, many proposed differentially expressed (DE) methods might generate bias. We propose a method, single cell mixed model score tests (scMMSTs), to efficiently identify DE genes of scRNA-seq data with batch effects using the generalized linear mixed model (GLMM). scMMSTs treat the batch effect as a  ...[more]

Similar Datasets

| S-EPMC6940275 | biostudies-literature
| S-EPMC5763373 | biostudies-other
| S-EPMC7652264 | biostudies-literature
| S-EPMC9939047 | biostudies-literature
| S-EPMC5772010 | biostudies-literature
| S-EPMC8477913 | biostudies-literature
| S-EPMC6491826 | biostudies-literature
| S-EPMC3381971 | biostudies-literature
| S-EPMC5592911 | biostudies-literature
| S-EPMC6763381 | biostudies-literature