Unknown

Dataset Information

0

Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring.


ABSTRACT: Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretation and power, by aggregating expression data to the pathway level. Unfortunately, methods optimized for bulk transcriptomics perform poorly on scRNA-seq data and progress on single cell-specific techniques has been limited. Importantly, no existing methods support cell-level gene set inference. To address this challenge, we developed a new gene set testing method, Variance-adjusted Mahalanobis (VAM), that integrates with the Seurat framework and can accommodate the technical noise, sparsity and large sample sizes characteristic of scRNA-seq data. The VAM method computes cell-specific pathway scores to transform a cell-by-gene matrix into a cell-by-pathway matrix that can be used for both data visualization and statistical enrichment analysis. Because the distribution of these scores under the null of uncorrelated technical noise has an accurate gamma approximation, both population and cell-level inference is supported. As demonstrated using simulated and real scRNA-seq data, the VAM method provides superior classification accuracy at a lower computation cost relative to existing single sample gene set testing approaches.

SUBMITTER: Frost HR 

PROVIDER: S-EPMC7498348 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring.

Frost Hildreth Robert HR  

Nucleic acids research 20200901 16


Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretation and power, by aggregating expression data to the pathway level. Unfortunately, methods optimized for bulk transcriptomics perform poorly on scRNA-seq data and progress on single cell-specific techn  ...[more]

Similar Datasets

| S-EPMC3776447 | biostudies-literature
| S-EPMC7261819 | biostudies-literature
| S-EPMC2760878 | biostudies-other
| S-EPMC2896140 | biostudies-literature
| S-EPMC7310859 | biostudies-literature
| S-EPMC6280799 | biostudies-other
| S-EPMC6379139 | biostudies-literature
| S-EPMC2525522 | biostudies-literature
| S-EPMC4315301 | biostudies-literature
| S-EPMC4525044 | biostudies-other