Unknown

Dataset Information

0

Determination of essential phenotypic elements of clusters in high-dimensional entities-DEPECHE.


ABSTRACT: Technological advances have facilitated an exponential increase in the amount of information that can be derived from single cells, necessitating new computational tools that can make such highly complex data interpretable. Here, we introduce DEPECHE, a rapid, parameter free, sparse k-means-based algorithm for clustering of multi- and megavariate single-cell data. In a number of computational benchmarks aimed at evaluating the capacity to form biologically relevant clusters, including flow/mass-cytometry and single cell RNA sequencing data sets with manually curated gold standard solutions, DEPECHE clusters as well or better than the currently available best performing clustering algorithms. However, the main advantage of DEPECHE, compared to the state-of-the-art, is its unique ability to enhance interpretability of the formed clusters, in that it only retains variables relevant for cluster separation, thereby facilitating computational efficient analyses as well as understanding of complex datasets. DEPECHE is implemented in the open source R package DepecheR currently available at github.com/Theorell/DepecheR.

SUBMITTER: Theorell A 

PROVIDER: S-EPMC6405191 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Determination of essential phenotypic elements of clusters in high-dimensional entities-DEPECHE.

Theorell Axel A   Bryceson Yenan Troi YT   Theorell Jakob J  

PloS one 20190307 3


Technological advances have facilitated an exponential increase in the amount of information that can be derived from single cells, necessitating new computational tools that can make such highly complex data interpretable. Here, we introduce DEPECHE, a rapid, parameter free, sparse k-means-based algorithm for clustering of multi- and megavariate single-cell data. In a number of computational benchmarks aimed at evaluating the capacity to form biologically relevant clusters, including flow/mass-  ...[more]

Similar Datasets

| S-EPMC8133874 | biostudies-literature
| S-EPMC3182942 | biostudies-literature
| S-EPMC3701810 | biostudies-other
| S-EPMC5738280 | biostudies-literature
| S-EPMC4815463 | biostudies-literature
| S-EPMC4076922 | biostudies-literature
| S-EPMC4082383 | biostudies-literature
| S-EPMC11017132 | biostudies-literature
| S-EPMC5627533 | biostudies-literature
| S-EPMC4046693 | biostudies-literature