Unknown

Dataset Information

0

Spectral clustering of single-cell multi-omics data on multilayer graphs.


ABSTRACT:

Motivation

Single-cell sequencing technologies that simultaneously generate multimodal cellular profiles present opportunities for improved understanding of cell heterogeneity in tissues. How the multimodal information can be integrated to obtain a common cell type identification, however, poses a computational challenge. Multilayer graphs provide a natural representation of multi-omic single-cell sequencing datasets, and finding cell clusters may be understood as a multilayer graph partition problem.

Results

We introduce two spectral algorithms on multilayer graphs, spectral clustering on multilayer graphs and the weighted locally linear (WLL) method, to cluster cells in multi-omic single-cell sequencing datasets. We connect these algorithms through a unifying mathematical framework that represents each layer using a Hamiltonian operator and a mixture of its eigenstates to integrate the multiple graph layers, demonstrating in the process that the WLL method is a rigorous multilayer spectral graph theoretic reformulation of the popular Seurat weighted nearest neighbor (WNN) algorithm. Implementing our algorithms and applying them to a CITE-seq dataset of cord blood mononuclear cells yields results similar to the Seurat WNN analysis. Our work thus extends spectral methods to multimodal single-cell data analysis.

Availability and implementation

The code used in this study can be found at https://github.com/jssong-lab/sc-spectrum. All public data used in the article are accurately cited and described in Materials and Methods and in Supplementary Information.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Zhang S 

PROVIDER: S-EPMC9272806 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Spectral clustering of single-cell multi-omics data on multilayer graphs.

Zhang Shuyi S   Leistico Jacob R JR   Cho Raymond J RJ   Cheng Jeffrey B JB   Song Jun S JS  

Bioinformatics (Oxford, England) 20220701 14


<h4>Motivation</h4>Single-cell sequencing technologies that simultaneously generate multimodal cellular profiles present opportunities for improved understanding of cell heterogeneity in tissues. How the multimodal information can be integrated to obtain a common cell type identification, however, poses a computational challenge. Multilayer graphs provide a natural representation of multi-omic single-cell sequencing datasets, and finding cell clusters may be understood as a multilayer graph part  ...[more]

Similar Datasets

| S-EPMC9805570 | biostudies-literature
| S-EPMC5991460 | biostudies-literature
| S-EPMC9710707 | biostudies-literature
| S-EPMC11015955 | biostudies-literature
| S-EPMC9748135 | biostudies-literature
| S-EPMC10288154 | biostudies-literature
| S-EPMC8597621 | biostudies-literature
| S-EPMC7703791 | biostudies-literature
| S-EPMC10329495 | biostudies-literature
| S-EPMC7293045 | biostudies-literature