Unknown

Dataset Information

0

Identifying homogeneous subgroups of patients and important features: a topological machine learning approach.


ABSTRACT:

Background

This paper exploits recent developments in topological data analysis to present a pipeline for clustering based on Mapper, an algorithm that reduces complex data into a one-dimensional graph.

Results

We present a pipeline to identify and summarise clusters based on statistically significant topological features from a point cloud using Mapper.

Conclusions

Key strengths of this pipeline include the integration of prior knowledge to inform the clustering process and the selection of optimal clusters; the use of the bootstrap to restrict the search to robust topological features; the use of machine learning to inspect clusters; and the ability to incorporate mixed data types. Our pipeline can be downloaded under the GNU GPLv3 license at https://github.com/kcl-bhi/mapper-pipeline .

SUBMITTER: Carr E 

PROVIDER: S-EPMC8451168 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5600667 | biostudies-literature
| S-EPMC8149626 | biostudies-literature
| S-EPMC8259419 | biostudies-literature
| S-EPMC3235098 | biostudies-literature
| S-EPMC7403213 | biostudies-literature
2023-06-01 | GSE193400 | GEO
| S-EPMC7599600 | biostudies-literature
| S-EPMC7380860 | biostudies-literature
| S-EPMC8739611 | biostudies-literature
| S-EPMC7682196 | biostudies-literature