Statistics or biology: the zero-inflation controversy about scRNA-seq data.
Ontology highlight
ABSTRACT: Researchers view vast zeros in single-cell RNA-seq data differently: some regard zeros as biological signals representing no or low gene expression, while others regard zeros as missing data to be corrected. To help address the controversy, here we discuss the sources of biological and non-biological zeros; introduce five mechanisms of adding non-biological zeros in computational benchmarking; evaluate the impacts of non-biological zeros on data analysis; benchmark three input data types: observed counts, imputed counts, and binarized counts; discuss the open questions regarding non-biological zeros; and advocate the importance of transparent analysis.
SUBMITTER: Jiang R
PROVIDER: S-EPMC8783472 | biostudies-literature |
REPOSITORIES: biostudies-literature
ACCESS DATA