Unknown

Dataset Information

0

Clustering: how much bias do we need?


ABSTRACT: Scientific investigations in medicine and beyond increasingly require observations to be described by more features than can be simultaneously visualized. Simply reducing the dimensionality by projections destroys essential relationships in the data. Similarly, traditional clustering algorithms introduce data bias that prevents detection of natural structures expected from generic nonlinear processes. We examine how these problems can best be addressed, where in particular we focus on two recent clustering approaches, Phenograph and Hebbian learning clustering, applied to synthetic and natural data examples. Our results reveal that already for very basic questions, minimizing clustering bias is essential, but that results can benefit further from biased post-processing.This article is part of the themed issue 'Mathematical methods in medicine: neuroscience, cardiology and pathology'.

SUBMITTER: Lorimer T 

PROVIDER: S-EPMC5434083 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Clustering: how much bias do we need?

Lorimer Tom T   Held Jenny J   Stoop Ruedi R  

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences 20170601 2096


Scientific investigations in medicine and beyond increasingly require observations to be described by more features than can be simultaneously visualized. Simply reducing the dimensionality by projections destroys essential relationships in the data. Similarly, traditional clustering algorithms introduce data bias that prevents detection of natural structures expected from generic nonlinear processes. We examine how these problems can best be addressed, where in particular we focus on two recent  ...[more]

Similar Datasets

| S-EPMC3341591 | biostudies-literature
| S-EPMC2879149 | biostudies-literature
| S-EPMC6808047 | biostudies-literature
2022-03-16 | PXD022124 | Pride
| S-EPMC9113673 | biostudies-literature
| S-EPMC4496034 | biostudies-literature
| S-EPMC6990430 | biostudies-literature
| S-EPMC4404656 | biostudies-literature
| S-EPMC4979233 | biostudies-other
| S-EPMC8164737 | biostudies-literature