Unknown

Dataset Information

0

Applying Machine Learning Algorithms to Segment High-Cost Patient Populations.


ABSTRACT:

Background

Efforts to improve the value of care for high-cost patients may benefit from care management strategies targeted at clinically distinct subgroups of patients.

Objective

To evaluate the performance of three different machine learning algorithms for identifying subgroups of high-cost patients.

Design

We applied three different clustering algorithms-connectivity-based clustering using agglomerative hierarchical clustering, centroid-based clustering with the k-medoids algorithm, and density-based clustering with the OPTICS algorithm-to a clinical and administrative dataset. We then examined the extent to which each algorithm identified subgroups of patients that were (1) clinically distinct and (2) associated with meaningful differences in relevant utilization metrics.

Participants

Patients enrolled in a national Medicare Advantage plan, categorized in the top decile of spending (n?=?6154).

Main measures

Post hoc discriminative models comparing the importance of variables for distinguishing observations in one cluster from the rest. Variance in utilization and spending measures.

Key results

Connectivity-based, centroid-based, and density-based clustering identified eight, five, and ten subgroups of high-cost patients, respectively. Post hoc discriminative models indicated that density-based clustering subgroups were the most clinically distinct. The variance of utilization and spending measures was the greatest among the subgroups identified through density-based clustering.

Conclusions

Machine learning algorithms can be used to segment a high-cost patient population into subgroups of patients that are clinically distinct and associated with meaningful differences in utilization and spending measures. For these purposes, density-based clustering with the OPTICS algorithm outperformed connectivity-based and centroid-based clustering algorithms.

SUBMITTER: Yan J 

PROVIDER: S-EPMC6374273 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Applying Machine Learning Algorithms to Segment High-Cost Patient Populations.

Yan Jiali J   Linn Kristin A KA   Powers Brian W BW   Zhu Jingsan J   Jain Sachin H SH   Kowalski Jennifer L JL   Navathe Amol S AS  

Journal of general internal medicine 20181212 2


<h4>Background</h4>Efforts to improve the value of care for high-cost patients may benefit from care management strategies targeted at clinically distinct subgroups of patients.<h4>Objective</h4>To evaluate the performance of three different machine learning algorithms for identifying subgroups of high-cost patients.<h4>Design</h4>We applied three different clustering algorithms-connectivity-based clustering using agglomerative hierarchical clustering, centroid-based clustering with the k-medoid  ...[more]

Similar Datasets

| S-EPMC6245495 | biostudies-other
| S-EPMC7013037 | biostudies-literature
| S-EPMC7304698 | biostudies-literature
| S-EPMC11366613 | biostudies-literature
| S-EPMC6907102 | biostudies-literature
2018-04-07 | GSE112798 | GEO
| S-EPMC7886120 | biostudies-literature
| S-EPMC7957118 | biostudies-literature
| S-EPMC9381914 | biostudies-literature
| S-EPMC9015194 | biostudies-literature