Unknown

Dataset Information

0

ScAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types.


ABSTRACT: Recent advancements in both single-cell RNA-sequencing technology and computational resources facilitate the study of cell types on global populations. Up to millions of cells can now be sequenced in one experiment; thus, accurate and efficient computational methods are needed to provide clustering and post-analysis of assigning putative and rare cell types. Here, we present a novel unsupervised deep learning clustering framework that is robust and highly scalable. To overcome the high level of noise, scAIDE first incorporates an autoencoder-imputation network with a distance-preserved embedding network (AIDE) to learn a good representation of data, and then applies a random projection hashing based k-means algorithm to accommodate the detection of rare cell types. We analyzed a 1.3 million neural cell dataset within 30 min, obtaining 64 clusters which were mapped to 19 putative cell types. In particular, we further identified three different neural stem cell developmental trajectories in these clusters. We also classified two subpopulations of malignant cells in a small glioblastoma dataset using scAIDE. We anticipate that scAIDE would provide a more in-depth understanding of cell development and diseases.

SUBMITTER: Xie K 

PROVIDER: S-EPMC7671411 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types.

Xie Kaikun K   Huang Yu Y   Zeng Feng F   Liu Zehua Z   Chen Ting T  

NAR genomics and bioinformatics 20201009 4


Recent advancements in both single-cell RNA-sequencing technology and computational resources facilitate the study of cell types on global populations. Up to millions of cells can now be sequenced in one experiment; thus, accurate and efficient computational methods are needed to provide clustering and post-analysis of assigning putative and rare cell types. Here, we present a novel unsupervised deep learning clustering framework that is robust and highly scalable. To overcome the high level of  ...[more]

Similar Datasets

| S-EPMC6409843 | biostudies-other
| S-EPMC5037380 | biostudies-literature
| S-EPMC6477982 | biostudies-literature
| S-EPMC8644062 | biostudies-literature
| S-EPMC5410170 | biostudies-literature
| S-EPMC7549635 | biostudies-literature
| S-EPMC7437817 | biostudies-literature
| S-EPMC8015854 | biostudies-literature
| S-EPMC7798298 | biostudies-literature
| S-EPMC5845381 | biostudies-literature