Dataset Information

Machine learning to predict rapid progression of carotid atherosclerosis in patients with impaired glucose tolerance.

ABSTRACT:

Objectives

Prediabetes is a major epidemic and is associated with adverse cardio-cerebrovascular outcomes. Early identification of patients who will develop rapid progression of atherosclerosis could be beneficial for improved risk stratification. In this paper, we investigate important factors impacting the prediction, using several machine learning methods, of rapid progression of carotid intima-media thickness in impaired glucose tolerance (IGT) participants.

Methods

In the Actos Now for Prevention of Diabetes (ACT NOW) study, 382 participants with IGT underwent carotid intima-media thickness (CIMT) ultrasound evaluation at baseline and at 15-18 months, and were divided into rapid progressors (RP, n = 39, 58 ± 17.5 μM change) and non-rapid progressors (NRP, n = 343, 5.8 ± 20 μM change, p < 0.001 versus RP). To deal with complex multi-modal data consisting of demographic, clinical, and laboratory variables, we propose a general data-driven framework to investigate the ACT NOW dataset. In particular, we first employed a Fisher Score-based feature selection method to identify the most effective variables and then proposed a probabilistic Bayes-based learning method for the prediction. Comparison of the methods and factors was conducted using area under the receiver operating characteristic curve (AUC) analyses and Brier score.

Results

The experimental results show that the proposed learning methods performed well in identifying or predicting RP. Among the methods, the performance of Naïve Bayes was the best (AUC 0.797, Brier score 0.085) compared to multilayer perceptron (0.729, 0.086) and random forest (0.642, 0.10). The results also show that feature selection has a significant positive impact on the data prediction performance.

Conclusions

By dealing with multi-modal data, the proposed learning methods show effectiveness in predicting prediabetics at risk for rapid atherosclerosis progression. The proposed framework demonstrated utility in outcome prediction in a typical multidimensional clinical dataset with a relatively small number of subjects, extending the potential utility of machine learning approaches beyond extremely large-scale datasets.

SUBMITTER: Hu X

PROVIDER: S-EPMC5011483 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine learning to predict rapid progression of carotid atherosclerosis in patients with impaired glucose tolerance.

Hu Xia X Reaven Peter D PD Saremi Aramesh A Liu Ninghao N Abbasi Mohammad Ali MA Liu Huan H Migrino Raymond Q RQ

EURASIP journal on bioinformatics & systems biology 20160905 1

<h4>Objectives</h4>Prediabetes is a major epidemic and is associated with adverse cardio-cerebrovascular outcomes. Early identification of patients who will develop rapid progression of atherosclerosis could be beneficial for improved risk stratification. In this paper, we investigate important factors impacting the prediction, using several machine learning methods, of rapid progression of carotid intima-media thickness in impaired glucose tolerance (IGT) participants.<h4>Methods</h4>In the Act ...[more]

PMID: 27642290

Similar Datasets

Project description:AimsTo define endotypes of carotid subclinical atherosclerosis.Methods and resultsWe integrated demographic, clinical, and molecular data (n = 124) with ultrasonographic carotid measurements from study participants in the IMPROVE cohort (n = 3340). We applied a neural network algorithm and hierarchical clustering to identify carotid atherosclerosis endotypes. A measure of carotid subclinical atherosclerosis, the c-IMTmean-max, was used to extract atherosclerosis-related features and SHapley Additive exPlanations (SHAP) to reveal endotypes. The association of endotypes with carotid ultrasonographic measurements at baseline, after 30 months, and with the 3-year atherosclerotic cardiovascular disease (ASCVD) risk was estimated by linear (β, SE) and Cox [hazard ratio (HR), 95% confidence interval (CI)] regression models. Crude estimates were adjusted by common cardiovascular risk factors, and baseline ultrasonographic measures. Improvement in ASCVD risk prediction was evaluated by C-statistic and by net reclassification improvement with reference to SCORE2, c-IMTmean-max, and presence of carotid plaques. An ensemble stacking model was used to predict endotypes in an independent validation cohort, the PIVUS (n = 1061). We identified four endotypes able to differentiate carotid atherosclerosis risk profiles from mild (endotype 1) to severe (endotype 4). SHAP identified endotype-shared variables (age, biological sex, and systolic blood pressure) and endotype-specific biomarkers. In the IMPROVE, as compared to endotype 1, endotype 4 associated with the thickest c-IMT at baseline (β, SE) 0.36 (0.014), the highest number of plaques 1.65 (0.075), the fastest c-IMT progression 0.06 (0.013), and the highest ASCVD risk (HR, 95% CI) (1.95, 1.18-3.23). Baseline and progression measures of carotid subclinical atherosclerosis and ASCVD risk were associated with the predicted endotypes in the PIVUS. Endotypes consistently improved measures of ASCVD risk discrimination and reclassification in both study populations.ConclusionsWe report four replicable subclinical carotid atherosclerosis-endotypes associated with progression of atherosclerosis and ASCVD risk in two independent populations. Our approach based on endotypes can be applied for precision medicine in ASCVD prevention.

Project description:Background Rapid coronary plaque progression (RPP) is associated with incident cardiovascular events. To date, no method exists for the identification of individuals at risk of RPP at a single point in time. This study integrated coronary computed tomography angiography-determined qualitative and quantitative plaque features within a machine learning (ML) framework to determine its performance for predicting RPP. Methods and Results Qualitative and quantitative coronary computed tomography angiography plaque characterization was performed in 1083 patients who underwent serial coronary computed tomography angiography from the PARADIGM (Progression of Atherosclerotic Plaque Determined by Computed Tomographic Angiography Imaging) registry. RPP was defined as an annual progression of percentage atheroma volume ≥1.0%. We employed the following ML models: model 1, clinical variables; model 2, model 1 plus qualitative plaque features; model 3, model 2 plus quantitative plaque features. ML models were compared with the atherosclerotic cardiovascular disease risk score, Duke coronary artery disease score, and a logistic regression statistical model. 224 patients (21%) were identified as RPP. Feature selection in ML identifies that quantitative computed tomography variables were higher-ranking features, followed by qualitative computed tomography variables and clinical/laboratory variables. ML model 3 exhibited the highest discriminatory performance to identify individuals who would experience RPP when compared with atherosclerotic cardiovascular disease risk score, the other ML models, and the statistical model (area under the receiver operating characteristic curve in ML model 3, 0.83 [95% CI 0.78-0.89], versus atherosclerotic cardiovascular disease risk score, 0.60 [0.52-0.67]; Duke coronary artery disease score, 0.74 [0.68-0.79]; ML model 1, 0.62 [0.55-0.69]; ML model 2, 0.73 [0.67-0.80]; all P<0.001; statistical model, 0.81 [0.75-0.87], P=0.128). Conclusions Based on a ML framework, quantitative atherosclerosis characterization has been shown to be the most important feature when compared with clinical, laboratory, and qualitative measures in identifying patients at risk of RPP.

Project description:BackgroundInternal carotid artery stenosis (ICAS) can cause stroke and cognitive decline. Associated hemodynamic impairments, which are most pronounced within individual watershed areas (iWSA) between vascular territories, can be assessed with hemodynamic-oxygenation-sensitive MRI and may help to detect severely affected patients. We aimed to identify the most sensitive parameters and volumes of interest (VOI) to predict high-grade ICAS with random forest machine learning. We hypothesized an increased predictive ability considering iWSAs and a decreased cognitive performance in correctly classified patients.Materials and methodsTwenty-four patients with asymptomatic, unilateral, high-grade carotid artery stenosis and 24 age-matched healthy controls underwent MRI comprising pseudo-continuous arterial spin labeling (pCASL), breath-holding functional MRI (BH-fMRI), dynamic susceptibility contrast (DSC), T2 and T2* mapping, MPRAGE and FLAIR. Quantitative maps of eight perfusion, oxygenation and microvascular parameters were obtained. Mean values of respective parameters within and outside of iWSAs split into gray (GM) and white matter (WM) were calculated for both hemispheres and for interhemispheric differences resulting in 96 features. Random forest classifiers were trained on whole GM/WM VOIs, VOIs considering iWSAs and with additional feature selection, respectively.ResultsThe most sensitive features in decreasing order were time-to-peak (TTP), cerebral blood flow (CBF) and cerebral vascular reactivity (CVR), all of these inside of iWSAs. Applying iWSAs combined with feature selection yielded significantly higher receiver operating characteristics areas under the curve (AUC) than whole GM/WM VOIs (AUC: 0.84 vs. 0.90, p = 0.039). Correctly predicted patients presented with worse cognitive performances than frequently misclassified patients (Trail-making-test B: 152.5s vs. 94.4s, p = 0.034).ConclusionRandom forest classifiers trained on multiparametric MRI data allow identification of the most relevant parameters and VOIs to predict ICAS, which may improve personalized treatments.

Dataset Information

Machine learning to predict rapid progression of carotid atherosclerosis in patients with impaired glucose tolerance.

Objectives

Methods

Results

Conclusions

Publications

Machine learning to predict rapid progression of carotid atherosclerosis in patients with impaired glucose tolerance.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets