Unknown

Dataset Information

0

Fast identification of differential distributions in single-cell RNA-sequencing data with waddR.


ABSTRACT: Single-cell gene expression distributions measured by single-cell RNA-sequencing (scRNA-seq) often display complex differences between samples. These differences are biologically meaningful but cannot be identified using standard methods for differential expression. Here, we derive and implement a flexible and fast differential distribution testing procedure based on the 2-Wasserstein distance. Our method is able to detect any type of difference in distribution between conditions. To interpret distributional differences, we decompose the 2-Wasserstein distance into terms that capture the relative contribution of changes in mean, variance and shape to the overall difference. Finally, we derive mathematical generalisations that allow our method to be used in a broad range of disciplines other than scRNA-seq or bioinformatics. Our methods are implemented in the R/Bioconductor package waddR, which is freely available at https://github.com/goncalves-lab/waddR, along with documentation and examples. Supplementary data are available at Bioinformatics online.

SUBMITTER: Schefzik R 

PROVIDER: S-EPMC8504634 | biostudies-literature | 2021 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast identification of differential distributions in single-cell RNA-sequencing data with waddR.

Schefzik Roman R   Flesch Julian J   Goncalves Angela A  

Bioinformatics (Oxford, England) 20211001 19


<h4>Motivation</h4>Single-cell gene expression distributions measured by single-cell RNA-sequencing (scRNA-seq) often display complex differences between samples. These differences are biologically meaningful but cannot be identified using standard methods for differential expression.<h4>Results</h4>Here, we derive and implement a flexible and fast differential distribution testing procedure based on the 2-Wasserstein distance. Our method is able to detect any type of difference in distribution  ...[more]

Similar Datasets

| S-EPMC7279618 | biostudies-literature
| S-EPMC8693570 | biostudies-literature
| S-EPMC10687889 | biostudies-literature
| S-EPMC10463720 | biostudies-literature
| S-EPMC6734286 | biostudies-literature
| S-EPMC7727875 | biostudies-literature
| S-EPMC10765963 | biostudies-literature
| S-EPMC5737676 | biostudies-literature
| S-EPMC10965033 | biostudies-literature
| S-EPMC6339299 | biostudies-literature