Unknown

Dataset Information

0

Modeling Persistent Trends in Distributions.


ABSTRACT: We present a nonparametric framework to model a short sequence of probability distributions that vary both due to underlying effects of sequential progression and confounding noise. To distinguish between these two types of variation and estimate the sequential-progression effects, our approach leverages an assumption that these effects follow a persistent trend. This work is motivated by the recent rise of single-cell RNA-sequencing experiments over a brief time course, which aim to identify genes relevant to the progression of a particular biological process across diverse cell populations. While classical statistical tools focus on scalar-response regression or order-agnostic differences between distributions, it is desirable in this setting to consider both the full distributions as well as the structure imposed by their ordering. We introduce a new regression model for ordinal covariates where responses are univariate distributions and the underlying relationship reflects consistent changes in the distributions over increasing levels of the covariate. This concept is formalized as a trend in distributions, which we define as an evolution that is linear under the Wasserstein metric. Implemented via a fast alternating projections algorithm, our method exhibits numerous strengths in simulations and analyses of single-cell gene expression data.

SUBMITTER: Mueller J 

PROVIDER: S-EPMC6428438 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Modeling Persistent Trends in Distributions.

Mueller Jonas J   Jaakkola Tommi T   Gifford David D  

Journal of the American Statistical Association 20180612 523


We present a nonparametric framework to model a short sequence of probability distributions that vary both due to underlying effects of sequential progression and confounding noise. To distinguish between these two types of variation and estimate the sequential-progression effects, our approach leverages an assumption that these effects follow a persistent trend. This work is motivated by the recent rise of single-cell RNA-sequencing experiments over a brief time course, which aim to identify ge  ...[more]

Similar Datasets

| S-EPMC10498191 | biostudies-literature
| S-EPMC6796490 | biostudies-literature
| S-EPMC6157073 | biostudies-literature
| S-EPMC4984523 | biostudies-literature
| S-EPMC6114938 | biostudies-literature
| S-EPMC7464021 | biostudies-literature
| S-EPMC7050271 | biostudies-literature
| S-EPMC5913757 | biostudies-literature
| S-EPMC2605472 | biostudies-literature
| S-EPMC5331158 | biostudies-literature