Unknown

Dataset Information

0

Simple and Scalable Algorithms for Cluster-Aware Precision Medicine.


ABSTRACT: AI-enabled precision medicine promises a transformational improvement in healthcare outcomes. However, training on biomedical data presents significant challenges as they are often high dimensional, clustered, and of limited sample size. To overcome these challenges, we propose a simple and scalable approach for cluster-aware embedding that combines latent factor methods with a convex clustering penalty in a modular way. Our novel approach overcomes the complexity and limitations of current joint embedding and clustering methods and enables hierarchically clustered principal component analysis (PCA), locally linear embedding (LLE), and canonical correlation analysis (CCA). Through numerical experiments and real-world examples, we demonstrate that our approach outperforms fourteen clustering methods on highly underdetermined problems (e.g., with limited sample size) as well as on large sample datasets. Importantly, our approach does not require the user to choose the desired number of clusters, yields improved model selection if they do, and yields interpretable hierarchically clustered embedding dendrograms. Thus, our approach improves significantly on existing methods for identifying patient subgroups in multiomics and neuroimaging data and enables scalable and interpretable biomarkers for precision medicine.

SUBMITTER: Buch AM 

PROVIDER: S-EPMC11251711 | biostudies-literature | 2024 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Simple and Scalable Algorithms for Cluster-Aware Precision Medicine.

Buch Amanda M AM   Liston Conor C   Grosenick Logan L  

Proceedings of machine learning research 20240501


AI-enabled precision medicine promises a transformational improvement in healthcare outcomes. However, training on biomedical data presents significant challenges as they are often high dimensional, clustered, and of limited sample size. To overcome these challenges, we propose a simple and scalable approach for cluster-aware embedding that combines latent factor methods with a convex clustering penalty in a modular way. Our novel approach overcomes the complexity and limitations of current join  ...[more]

Similar Datasets

| S-EPMC11893218 | biostudies-literature
| S-EPMC5923898 | biostudies-literature
| S-EPMC4830395 | biostudies-literature
| S-EPMC9940622 | biostudies-literature
| S-EPMC11748347 | biostudies-literature
| S-EPMC7900864 | biostudies-literature
| S-EPMC6321874 | biostudies-literature
| S-EPMC6072070 | biostudies-literature
| S-EPMC5937479 | biostudies-literature
2024-06-10 | MSV000094966 | MassIVE