Unknown

Dataset Information

0

Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization.


ABSTRACT: Single-cell RNA-Sequencing (scRNA-Seq) is a fast-evolving technology that enables the understanding of biological processes at an unprecedentedly high resolution. However, well-suited bioinformatics tools to analyze the data generated from this new technology are still lacking. Here we investigate the performance of non-negative matrix factorization (NMF) method to analyze a wide variety of scRNA-Seq datasets, ranging from mouse hematopoietic stem cells to human glioblastoma data. In comparison to other unsupervised clustering methods including K-means and hierarchical clustering, NMF has higher accuracy in separating similar groups in various datasets. We ranked genes by their importance scores (D-scores) in separating these groups, and discovered that NMF uniquely identifies genes expressed at intermediate levels as top-ranked genes. Finally, we show that in conjugation with the modularity detection method FEM, NMF reveals meaningful protein-protein interaction modules. In summary, we propose that NMF is a desirable method to analyze heterogeneous single-cell RNA-Seq data. The NMF based subpopulation detection package is available at: https://github.com/lanagarmire/NMFEM.

SUBMITTER: Zhu X 

PROVIDER: S-EPMC5251935 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization.

Zhu Xun X   Ching Travers T   Pan Xinghua X   Weissman Sherman M SM   Garmire Lana L  

PeerJ 20170119


Single-cell RNA-Sequencing (scRNA-Seq) is a fast-evolving technology that enables the understanding of biological processes at an unprecedentedly high resolution. However, well-suited bioinformatics tools to analyze the data generated from this new technology are still lacking. Here we investigate the performance of non-negative matrix factorization (NMF) method to analyze a wide variety of scRNA-Seq datasets, ranging from mouse hematopoietic stem cells to human glioblastoma data. In comparison  ...[more]

Similar Datasets

| S-EPMC4562600 | biostudies-literature
| S-EPMC11338450 | biostudies-literature
| S-EPMC7671375 | biostudies-literature
| S-EPMC5006236 | biostudies-literature
| S-EPMC10292752 | biostudies-literature
| S-EPMC3931316 | biostudies-literature
| S-EPMC1434777 | biostudies-literature
| S-EPMC5746986 | biostudies-literature
| S-EPMC10690235 | biostudies-literature
| S-EPMC3479143 | biostudies-literature