Unknown

Dataset Information

0

Automatic identification of relevant genes from low-dimensional embeddings of single-cell RNA-seq data.


ABSTRACT: MOTIVATION:Dimensionality reduction is a key step in the analysis of single-cell RNA-sequencing data. It produces a low-dimensional embedding for visualization and as a calculation base for downstream analysis. Nonlinear techniques are most suitable to handle the intrinsic complexity of large, heterogeneous single-cell data. However, with no linear relation between gene and embedding coordinate, there is no way to extract the identity of genes driving any cell's position in the low-dimensional embedding, making it difficult to characterize the underlying biological processes. RESULTS:In this article, we introduce the concepts of local and global gene relevance to compute an equivalent of principal component analysis loadings for non-linear low-dimensional embeddings. Global gene relevance identifies drivers of the overall embedding, while local gene relevance identifies those of a defined sub-region. We apply our method to single-cell RNA-seq datasets from different experimental protocols and to different low-dimensional embedding techniques. This shows our method's versatility to identify key genes for a variety of biological processes. AVAILABILITY AND IMPLEMENTATION:To ensure reproducibility and ease of use, our method is released as part of destiny 3.0, a popular R package for building diffusion maps from single-cell transcriptomic data. It is readily available through Bioconductor. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Angerer P 

PROVIDER: S-EPMC7520047 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Automatic identification of relevant genes from low-dimensional embeddings of single-cell RNA-seq data.

Angerer Philipp P   Fischer David S DS   Theis Fabian J FJ   Scialdone Antonio A   Marr Carsten C  

Bioinformatics (Oxford, England) 20200801 15


<h4>Motivation</h4>Dimensionality reduction is a key step in the analysis of single-cell RNA-sequencing data. It produces a low-dimensional embedding for visualization and as a calculation base for downstream analysis. Nonlinear techniques are most suitable to handle the intrinsic complexity of large, heterogeneous single-cell data. However, with no linear relation between gene and embedding coordinate, there is no way to extract the identity of genes driving any cell's position in the low-dimen  ...[more]

Similar Datasets

| S-EPMC7374962 | biostudies-literature
| S-EPMC4995760 | biostudies-literature
| S-EPMC4758103 | biostudies-literature
| S-EPMC6501316 | biostudies-literature
| S-EPMC6046202 | biostudies-literature
| S-EPMC10895345 | biostudies-literature
| S-EPMC6830085 | biostudies-literature
| S-EPMC8009088 | biostudies-literature
| S-EPMC6515904 | biostudies-literature
| S-EPMC5039928 | biostudies-literature