Unknown

Dataset Information

0

A practical solution to pseudoreplication bias in single-cell studies.


ABSTRACT: Cells from the same individual share common genetic and environmental backgrounds and are not statistically independent; therefore, they are subsamples or pseudoreplicates. Thus, single-cell data have a hierarchical structure that many current single-cell methods do not address, leading to biased inference, highly inflated type 1 error rates, and reduced robustness and reproducibility. This includes methods that use a batch effect correction for individual as a means of accounting for within-sample correlation. Here, we document this dependence across a range of cell types and show that pseudo-bulk aggregation methods are conservative and underpowered relative to mixed models. To compute differential expression within a specific cell type across treatment groups, we propose applying generalized linear mixed models with a random effect for individual, to properly account for both zero inflation and the correlation structure among measures from cells within an individual. Finally, we provide power estimates across a range of experimental conditions to assist researchers in designing appropriately powered studies.

SUBMITTER: Zimmerman KD 

PROVIDER: S-EPMC7854630 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A practical solution to pseudoreplication bias in single-cell studies.

Zimmerman Kip D KD   Espeland Mark A MA   Langefeld Carl D CD  

Nature communications 20210202 1


Cells from the same individual share common genetic and environmental backgrounds and are not statistically independent; therefore, they are subsamples or pseudoreplicates. Thus, single-cell data have a hierarchical structure that many current single-cell methods do not address, leading to biased inference, highly inflated type 1 error rates, and reduced robustness and reproducibility. This includes methods that use a batch effect correction for individual as a means of accounting for within-sam  ...[more]

Similar Datasets

2017-05-11 | GSE98734 | GEO
| S-EPMC9238114 | biostudies-literature
| S-EPMC5794922 | biostudies-literature
| S-EPMC6525515 | biostudies-literature
| S-EPMC8100258 | biostudies-literature
| S-EPMC2817684 | biostudies-literature
| S-EPMC3322593 | biostudies-literature
| S-EPMC6188620 | biostudies-literature
| S-EPMC7814346 | biostudies-literature
| S-EPMC7442857 | biostudies-literature