Unknown

Dataset Information

0

Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data.


ABSTRACT: Single-cell RNA sequencing enables us to characterize the cellular heterogeneity in single cell resolution with the help of cell type identification algorithms. However, the noise inherent in single-cell RNA-sequencing data severely disturbs the accuracy of cell clustering, marker identification and visualization. We propose that clustering based on feature density profiles can distinguish informative features from noise. We named such strategy as 'entropy subspace' separation and designed a cell clustering algorithm called ENtropy subspace separation-based Clustering for nOise REduction (ENCORE) by integrating the 'entropy subspace' separation strategy with a consensus clustering method. We demonstrate that ENCORE performs superiorly on cell clustering and generates high-resolution visualization across 12 standard datasets. More importantly, ENCORE enables identification of group markers with biological significance from a hard-to-separate dataset. With the advantages of effective feature selection, improved clustering, accurate marker identification and high-resolution visualization, we present ENCORE to the community as an important tool for scRNA-seq data analysis to study cellular heterogeneity and discover group markers.

SUBMITTER: Song J 

PROVIDER: S-EPMC7897472 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data.

Song Jia J   Liu Yao Y   Zhang Xuebing X   Wu Qiuyue Q   Gao Juan J   Wang Wei W   Li Jin J   Song Yanling Y   Yang Chaoyong C  

Nucleic acids research 20210201 3


Single-cell RNA sequencing enables us to characterize the cellular heterogeneity in single cell resolution with the help of cell type identification algorithms. However, the noise inherent in single-cell RNA-sequencing data severely disturbs the accuracy of cell clustering, marker identification and visualization. We propose that clustering based on feature density profiles can distinguish informative features from noise. We named such strategy as 'entropy subspace' separation and designed a cel  ...[more]

Similar Datasets

| S-EPMC8157426 | biostudies-literature
| S-EPMC5888655 | biostudies-literature
| S-EPMC8682753 | biostudies-literature
| S-EPMC8015842 | biostudies-literature
| S-EPMC8429846 | biostudies-literature
| S-EPMC8344557 | biostudies-literature
| S-EPMC7141853 | biostudies-literature
| S-EPMC8439043 | biostudies-literature
| S-EPMC6022691 | biostudies-other