Ontology highlight
ABSTRACT:
SUBMITTER: Kim TH
PROVIDER: S-EPMC7412673 | biostudies-literature | 2020 Aug
REPOSITORIES: biostudies-literature
Kim Tae Hyun TH Zhou Xiang X Chen Mengjie M
Genome biology 20200806 1
Many existing pipelines for scRNA-seq data apply pre-processing steps such as normalization or imputation to account for excessive zeros or "drop-outs." Here, we extensively analyze diverse UMI data sets to show that clustering should be the foremost step of the workflow. We observe that most drop-outs disappear once cell-type heterogeneity is resolved, while imputing or normalizing heterogeneous data can introduce unwanted noise. We propose a novel framework HIPPO (Heterogeneity-Inspired Pre-Pr ...[more]