Project description:Background and purposeMechanical thrombectomy greatly improves stroke outcomes. Nonetheless, some patients fall short of full recovery despite good reperfusion. The purpose of this study was to develop machine learning (ML) models for the pre-interventional prediction of functional outcome at 3 months of thrombectomy in acute ischemic stroke (AIS), using clinical and auto-extractable radiological information consistently available upon first emergency evaluation.Materials and methodsA two-center retrospective cohort of 293 patients with AIS who underwent thrombectomy was analyzed. ML models were developed to predict dichotomized modified Rankin score at 90 days (mRS-90) using clinical and imaging features, both separately and combined. Conventional and experimental imaging biomarkers were quantified using automated image-processing software from non-contract computed tomography (CT) and computed tomography angiography (CTA). Shapley Additive Explanation (SHAP) was applied for model interpretability and predictor importance analysis of the optimal model.ResultsMerging clinical and imaging features returned the best results for mRS-90 prediction. The best performing classifier was Extreme Gradient Boosting (XGB) with an area under the receiver operating characteristic curve (AUC) = 84% using selected features. The most important classifying features were age, baseline National Institutes of Health Stroke Scale (NIHSS), occlusion side, degree of brain atrophy [primarily represented by cortical cerebrospinal fluid (CSF) volume and lateral ventricle volume], early ischemic core [primarily represented by e-Alberta Stroke Program Early CT Score (ASPECTS)], and collateral circulation deficit volume on CTA.ConclusionMachine learning that is applied to quantifiable image features from CT and CTA alongside basic clinical characteristics constitutes a promising automated method in the pre-interventional prediction of stroke prognosis. Interpretable models allow for exploring which initial features contribute the most to post-thrombectomy outcome prediction overall and for each individual patient outcome.

Project description:Acute stroke is often superimposed on chronic damage from previous cerebrovascular events. This background will inevitably modulate the impact of acute injury on clinical outcomes to an extent that will depend on the precise anatomical pattern of damage. Previous attempts to quantify such modulation have employed only reductive models that ignore anatomical detail. The combination of automated image processing, large-scale data, and machine learning now enables us to quantify the impact of this with high-dimensional multivariate models sensitive to individual variations in the detailed anatomical pattern. We introduce and validate a new automated chronic lesion segmentation routine for use with non-contrast CT brain scans, combining non-parametric outlier-detection score, Zeta, with an unsupervised 3-dimensional maximum-flow, minimum-cut algorithm. The routine was then applied to a dataset of 1,704 stroke patient scans, obtained at their presentation to a hyper-acute stroke unit (St George's Hospital, London, UK), and used to train a support vector machine (SVM) model to predict between low (0-2) and high (3-6) pre-admission and discharge modified Rankin Scale (mRS) scores, quantifying performance by the area under the receiver operating curve (AUROC). In this single center retrospective observational study, our SVM models were able to differentiate between low (0-2) and high (3-6) pre-admission and discharge mRS scores with an AUROC of 0.77 (95% confidence interval of 0.74-0.79), and 0.76 (0.74-0.78), respectively. The chronic lesion segmentation routine achieved a mean (standard deviation) sensitivity, specificity and Dice similarity coefficient of 0.746 (0.069), 0.999 (0.001), and 0.717 (0.091), respectively. We have demonstrated that machine learning models capable of capturing the high-dimensional features of chronic injuries are able to stratify patients-at the time of presentation-by pre-admission and discharge mRS scores. Our fully automated chronic stroke lesion segmentation routine simplifies this process, and utilizes routinely collected CT head scans, thereby facilitating future large-scale studies to develop supportive clinical decision tools.

Project description:BackgroundThe prognosis, recurrence rates, and secondary prevention strategies varied significantly among different subtypes of acute ischemic stroke (AIS). Machine learning (ML) techniques can uncover intricate, non-linear relationships within medical data, enabling the identification of factors associated with etiological classification. However, there is currently a lack of research utilizing ML algorithms for predicting AIS etiology.ObjectiveWe aimed to use interpretable ML algorithms to develop AIS etiology prediction models, identify critical factors in etiology classification, and enhance existing clinical categorization.MethodsThis study involved patients with the Third China National Stroke Registry (CNSR-III). Nine models, which included Natural Gradient Boosting (NGBoost), Categorical Boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Random Forest (RF), Light Gradient Boosting Machine (LGBM), Gradient Boosting Decision Tree (GBDT), Adaptive Boosting (AdaBoost), Support Vector Machine (SVM), and logistic regression (LR), were employed to predict large artery atherosclerosis (LAA), small vessel occlusion (SVO), and cardioembolism (CE) using an 80:20 randomly split training and test set. We designed an SFS-XGB with 10-fold cross-validation for feature selection. The primary evaluation metrics for the models included the area under the receiver operating characteristic curve (AUC) for discrimination and the Brier score (or calibration plots) for calibration.ResultsA total of 5,213 patients were included, comprising 2,471 (47.4%) with LAA, 2,153 (41.3%) with SVO, and 589 (11.3%) with CE. In both LAA and SVO models, the AUC values of the ML models were significantly higher than that of the LR model (P < 0.001). The optimal model for predicting SVO (AUC [RF model] = 0.932) outperformed the optimal LAA model (AUC [NGB model] = 0.917) and the optimal CE model (AUC [LGBM model] = 0.846). Each model displayed relatively satisfactory calibration. Further analysis showed that the optimal CE model could identify potential CE patients in the undetermined etiology (SUE) group, accounting for 1,900 out of 4,156 (45.7%).ConclusionsThe ML algorithm effectively classified patients with LAA, SVO, and CE, demonstrating superior classification performance compared to the LR model. The optimal ML model can identify potential CE patients among SUE patients. These newly identified predictive factors may complement the existing etiological classification system, enabling clinicians to promptly categorize stroke patients' etiology and initiate optimal strategies for secondary prevention.

Dataset Information

Interpretable machine learning for prediction of clinical outcomes in acute ischemic stroke

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets