Dataset Information

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes.

ABSTRACT: The heterogeneity of traumatic brain injury (TBI) remains a core challenge for the success of interventional clinical trials. Data-driven approaches for patient stratification may help to identify TBI patient phenotypes during the acute injury period as well as facilitate targeted trial patient enrollment and analysis of treatment efficacy. In this study, we implemented an unsupervised machine learning approach to identify TBI subpopulations at injury baseline using data from 1213 TBI patients who participated in the Citicoline Brain Injury Treatment Trial (COBRIT) Trial. A wrapper framework utilizing generalized low-rank models automatically selected relevant clinical features that were subsequently used to cluster patients using a partitioning around medoids clustering algorithm. Using this approach, we identified three patient phenotypes with unique clinical injury profiles based on a subset of acute injury features. Phenotype-specific differences in long-term functional outcome trajectories were respectively observed at 3 and 6 months after injury. In comparison, when patients were grouped by baseline Glasgow Coma Scale (GCS), no differences in baseline clinical feature profiles or long-term outcomes were observed. To test phenotype reproducibility in an external validation data set, we used a K-nearest neighbors algorithm to classify subjects in the Transforming Research and Clinical Knowledge in Traumatic Brain Injury (TRACK-TBI) Pilot data set into corresponding phenotypes, then measured the Gower's dissimilarities between TRACK-TBI and COBRIT subjects in each phenotype. No significant differences were found between trial subjects within two phenotypes, suggesting that these phenotypes may be generalizable within a broad range of TBI severity. Further, Extended Glasgow Outcome Scale (GOS-E) outcomes in the TRACK-TBI data set similarly demonstrated phenotype-specific differences in long-term outcomes. Our results suggest that unsupervised machine learning is a promising and effective approach for discovery of novel injury subpopulations over the conventional GCS-based method, and may improve patient selection in future TBI clinical trials.

SUBMITTER: Folweiler KA

PROVIDER: S-EPMC7249479 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes.

Folweiler Kaitlin A KA Sandsmark Danielle K DK Diaz-Arrastia Ramon R Cohen Akiva S AS Masino Aaron J AJ

Journal of neurotrauma 20200311 12

The heterogeneity of traumatic brain injury (TBI) remains a core challenge for the success of interventional clinical trials. Data-driven approaches for patient stratification may help to identify TBI patient phenotypes during the acute injury period as well as facilitate targeted trial patient enrollment and analysis of treatment efficacy. In this study, we implemented an unsupervised machine learning approach to identify TBI subpopulations at injury baseline using data from 1213 TBI patients w ...[more]

PMID: 32008422

Similar Datasets

Project description:RationaleMore targeted management of severe acute pediatric asthma could improve clinical outcomes.ObjectivesTo identify distinct clinical phenotypes of severe acute pediatric asthma using variables obtained in the first 12 h of hospitalization.MethodsWe conducted a retrospective cohort study in a quaternary care children's hospital from 2014 to 2022. Encounters for children ages 2-18 years admitted to the hospital for asthma were included. We used consensus k means clustering with patient demographics, vital signs, diagnostics, and laboratory data obtained in the first 12 h of hospitalization.Measurements and main resultsThe study population included 683 encounters divided into derivation (80%) and validation (20%) sets, and two distinct clusters were identified. Compared to Cluster 1 in the derivation set, Cluster 2 encounters (177 [32%]) were older (11 years [8; 14] vs. 5 years [3; 8]; p < .01) and more commonly males (63% vs. 53%; p = .03) of Black race (51% vs. 40%; p = .03) with non-Hispanic ethnicity (96% vs. 84%; p < .01). Cluster 2 encounters had smaller improvements in vital signs at 12-h including percent change in heart rate (-1.7 [-11.7; 12.7] vs. -7.8 [-18.5; 1.7]; p < .01), and respiratory rate (0.0 [-20.0; 22.2] vs. -11.4 [-27.3; 9.0]; p < .01). Encounters in Cluster 2 had lower percentages of neutrophils (70.0 [55.0; 83.0] vs. 85.0 [77.0; 90.0]; p < .01) and higher percentages of lymphocytes (17.0 [8.0; 32.0] vs. 9.0 [5.3; 14.0]; p < .01). Cluster 2 encounters had higher rates of invasive mechanical ventilation (23% vs. 5%; p < .01), longer hospital length of stay (4.5 [2.6; 8.8] vs. 2.9 [2.0; 4.3]; p < .01), and a higher mortality rate (7.3% vs. 0.0%; p < .01). The predicted cluster assignments in the validation set shared the same ratio (~2:1), and many of the same characteristics.ConclusionsWe identified two clinical phenotypes of severe acute pediatric asthma which exhibited distinct clinical features and outcomes.

Project description:ObjectiveOverlapping chronic pain syndromes, including fibromyalgia, are heterogeneous and often treatment-resistant entities carrying significant socioeconomic burdens. Individualized treatment approaches from both a somatic and psychological side are necessary to improve patient care. The objective of this study was to identify and visualize patient clusters in refractory musculoskeletal pain syndromes through an extensive set of clinical variables, including immunologic, psychosomatic, wearable, and sleep biomarkers.MethodsData were collected during a multimodal pain program involving 202 patients. Seventy-eight percent of the patients fulfilled the criteria for fibromyalgia, 77% had a concomitant psychiatric-mediated disorder, and 22% a concomitant rheumatic immune-mediated disorder. Five patient phenotypes were identified by hierarchical agglomerative clustering as a form of unsupervised learning, and a predictive model for the Brief Pain Inventory (BPI) response was generated. Based on the clustering data, digital personas were created with DALL-E (OpenAI).ResultsThe most relevant distinguishing factors among clusters were living alone, body mass index, peripheral joint pain, alexithymia, psychiatric comorbidity, childhood pain, neuroleptic or benzodiazepine medication, and response to virtual reality. Having an immune-mediated disorder was not discriminatory. Three of five clusters responded to the multimodal treatment in terms of pain (BPI intensity), one cluster responded in terms of functional improvement (BPI interference), and one cluster notably responded to the virtual reality intervention. The independent predictive model confirmed strong opioids, trazodone, neuroleptic treatment, and living alone as the most important negative predictive factors for reduced pain after the program.ConclusionOur model identified and visualized clinically relevant chronic musculoskeletal pain subtypes and predicted their response to multimodal treatment. Such digital personas and avatars may play a future role in the design of personalized therapeutic modalities and clinical trials.

Project description:This work aimed to identify pre-existing health conditions of patients with traumatic brain injury (TBI) and develop predictive models for the first TBI event and its external causes by employing a combination of unsupervised and supervised learning algorithms. We acquired up to five years of pre-injury diagnoses for 488,107 patients with TBI and 488,107 matched control patients who entered the emergency department or acute care hospitals between April 1st, 2002, and March 31st, 2020. Diagnoses were obtained from the Ontario Health Insurance Plan (OHIP) database which contains province-wide claims data by physicians in Ontario, Canada for inpatient and outpatient services. A screening process was conducted on the OHIP diagnostic codes to limit the subsequent analysis to codes that were predictive of TBI, which concluded that 314 codes were significantly associated with TBI. The Latent Dirichlet Allocation (LDA) model was applied to the diagnostic codes and generated an optimal number of 19 topics that concur with published literature but also suggest other unexplored areas. Estimated word-topic probabilities from the LDA model helped us detect pre-morbid conditions among patients with TBI by uncovering the underlying patterns of diagnoses, meanwhile estimated document-topic probabilities were utilized in variable creation as form of a dimension reduction. We created 19 topic scores for each patient in the cohort which were utilized along with socio-demographic factors for Random Forest binary classifier models. Test set performances evaluated using area under the receiver operating characteristic curve (AUC) were: TBI event (AUC = 0.85), external cause of injury: falls (AUC = 0.85), struck by/against (AUC = 0.83), cyclist collision (AUC = 0.76), motor vehicle collision (AUC = 0.83). Our analysis successfully demonstrated the feasibility of using machine learning to predict TBI due to various external causes and identified the most important factors that contribute to this prediction.

Dataset Information

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes.

Publications

Unsupervised Machine Learning Reveals Novel Traumatic Brain Injury Patient Phenotypes with Distinct Acute Injury Profiles and Long-Term Outcomes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets