Unknown

Dataset Information

0

Assessing reproducibility and utility of clustering of patients with type 2 diabetes and established CV disease (SAVOR -TIMI 53 trial).


ABSTRACT:

Objective

To assess the reproducibility and clinical utility of clustering-based subtyping of patients with type 2 diabetes (T2D) and established cardiovascular (CV) disease.

Methods

The cardiovascular outcome trial SAVOR-TIMI 53 (n = 16,492) was used. Analyses focused on T2D patients with established CV disease. Unsupervised machine learning technique called "k-means clustering" was used to divide patients into subtypes. K-means clustering including HbA1c, age of diagnosis, BMI, HOMA2-IR and HOMA2-B was used to assign clusters to the following diabetes subtypes: severe insulin deficient diabetes (SIDD); severe insulin-resistant diabetes (SIRD); mild obesity-related diabetes (MOD); mild age-related diabetes (MARD). We refer these subtypes as "clustering-based diabetes subtypes". A simulation study using randomly generated data was conducted to understand how correlations between the above variables influence the formation of the cluster-based diabetes subtypes. The predictive utility of clustering-based diabetes subtypes for CV events (3-point MACE), renal function reduction (eGFR decrease >30%) and diabetic disease progression (introduction of additional anti-diabetic medication) were compared with conventional risk scores. Hazard ratios (HR) were estimated by Cox-proportional hazard models.

Results

In the SAVOR-TIMI 53 trial based dataset, the percentage of the clustering-based T2D subtypes were; SIDD (18%), SIRD (17%), MOD (29%), MARD (37%). Using the simulated dataset, the diabetes subtypes could be largely reproduced from a log-normal distribution when including known correlations between variables. The predictive utility of clustering-based diabetic subtypes on CV events, renal function reduction, and diabetic disease progression did not show an advantage compared to conventional risk scores.

Conclusions

The consistent reproduction of four clustering-based T2D subtypes can be explained by the correlations between the variables used for clustering. Subtypes of T2D based on clustering had limited advantage compared to conventional risk scores to predict clinical outcome in patients with T2D and established CV disease.

SUBMITTER: Aoki Y 

PROVIDER: S-EPMC8604302 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6594440 | biostudies-literature
| S-EPMC8252001 | biostudies-literature
| S-EPMC7717477 | biostudies-literature
| S-EPMC10339653 | biostudies-literature
| S-EPMC9032728 | biostudies-literature
| S-EPMC4488669 | biostudies-literature
| S-EPMC2732984 | biostudies-literature
| S-EPMC3730221 | biostudies-other
| S-EPMC10390619 | biostudies-literature
| S-EPMC5848819 | biostudies-literature