Unknown

Dataset Information

0

Probabilistic modeling of bifurcations in single-cell gene expression data using a Bayesian mixture of factor analyzers.


ABSTRACT: Modeling bifurcations in single-cell transcriptomics data has become an increasingly popular field of research. Several methods have been proposed to infer bifurcation structure from such data, but all rely on heuristic non-probabilistic inference. Here we propose the first generative, fully probabilistic model for such inference based on a Bayesian hierarchical mixture of factor analyzers. Our model exhibits competitive performance on large datasets despite implementing full Markov-Chain Monte Carlo sampling, and its unique hierarchical prior structure enables automatic determination of genes driving the bifurcation process. We additionally propose an Empirical-Bayes like extension that deals with the high levels of zero-inflation in single-cell RNA-seq data and quantify when such models are useful. We apply or model to both real and simulated single-cell gene expression data and compare the results to existing pseudotime methods. Finally, we discuss both the merits and weaknesses of such a unified, probabilistic approach in the context practical bioinformatics analyses.

SUBMITTER: Campbell KR 

PROVIDER: S-EPMC5428745 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2367561 | biostudies-literature
| S-EPMC7954949 | biostudies-literature
| S-EPMC7307974 | biostudies-literature
| S-EPMC6858378 | biostudies-literature
| S-EPMC3649281 | biostudies-literature
2019-03-09 | GSE128066 | GEO
| S-EPMC10257482 | biostudies-literature
| S-EPMC7487589 | biostudies-literature
| S-EPMC6456731 | biostudies-literature
| S-EPMC2796715 | biostudies-literature