Dataset Information

Preoperative Prediction of Lymph Node Metastasis in Patients With Early-T-Stage Non-small Cell Lung Cancer by Machine Learning Algorithms.

ABSTRACT: Background: Lymph node metastasis (LNM) is difficult to precisely predict before surgery in patients with early-T-stage non-small cell lung cancer (NSCLC). This study aimed to develop machine learning (ML)-based predictive models for LNM. Methods: Clinical characteristics and imaging features were retrospectively collected from 1,102 NSCLC ? 2 cm patients. A total of 23 variables were included to develop predictive models for LNM by multiple ML algorithms. The models were evaluated by the receiver operating characteristic (ROC) curve for predictive performance and decision curve analysis (DCA) for clinical values. A feature selection approach was used to identify optimal predictive factors. Results: The areas under the ROC curve (AUCs) of the 8 models ranged from 0.784 to 0.899. Some ML-based models performed better than models using conventional statistical methods in both ROC curves and decision curves. The random forest classifier (RFC) model with 9 variables introduced was identified as the best predictive model. The feature selection indicated the top five predictors were tumor size, imaging density, carcinoembryonic antigen (CEA), maximal standardized uptake value (SUVmax), and age. Conclusions: By incorporating clinical characteristics and radiographical features, it is feasible to develop ML-based models for the preoperative prediction of LNM in early-T-stage NSCLC, and the RFC model performed best.

SUBMITTER: Wu Y

PROVIDER: S-EPMC7237747 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Preoperative Prediction of Lymph Node Metastasis in Patients With Early-T-Stage Non-small Cell Lung Cancer by Machine Learning Algorithms.

Wu Yijun Y Liu Jianghao J Han Chang C Liu Xinyu X Chong Yuming Y Wang Zhile Z Gong Liang L Zhang Jiaqi J Gao Xuehan X Guo Chao C Liang Naixin N Li Shanqing S

Frontiers in oncology 20200513

<b>Background:</b> Lymph node metastasis (LNM) is difficult to precisely predict before surgery in patients with early-T-stage non-small cell lung cancer (NSCLC). This study aimed to develop machine learning (ML)-based predictive models for LNM. <b>Methods:</b> Clinical characteristics and imaging features were retrospectively collected from 1,102 NSCLC ≤ 2 cm patients. A total of 23 variables were included to develop predictive models for LNM by multiple ML algorithms. The models were evaluated ...[more]

PMID: 32477952

Similar Datasets

Project description:Lymph node involvement increases the risk of breast cancer recurrence. An accurate non-invasive assessment of nodal involvement is valuable in cancer staging, surgical risk, and cost savings. Radiomics has been proposed to pre-operatively predict sentinel lymph node (SLN) status; however, radiomic models are known to be sensitive to acquisition parameters. The purpose of this study was to develop a prediction model for preoperative prediction of SLN metastasis using deep learning-based (DLB) features and compare its predictive performance to state-of-the-art radiomics. Specifically, this study aimed to compare the generalizability of radiomics vs DLB features in an independent test set with dissimilar resolution. Dynamic contrast-enhancement images from 198 patients (67 positive SLNs) were used in this study. Of these subjects, 163 had an in-plane resolution of 0.7 × 0.7 mm2, which were randomly divided into a training set (approximately 67%) and a validation set (approximately 33%). The remaining 35 subjects with a different in-plane resolution (0.78 × 0.78 mm2) were treated as independent testing set for generalizability. Two methods were employed: (1) conventional radiomics (CR), and (2) DLB features which replaced hand-curated features with pre-trained VGG-16 features. The threshold determined using the training set was applied to the independent validation and testing dataset. Same feature reduction, feature selection, model creation procedures were used for both approaches. In the validation set (same resolution as training), the DLB model outperformed the CR model (accuracy 83% vs 80%). Furthermore, in the independent testing set of the dissimilar resolution, the DLB model performed markedly better than the CR model (accuracy 77% vs 71%). The predictive performance of the DLB model outperformed the CR model for this task. More interestingly, these improvements were seen particularly in the independent testing set of dissimilar resolution. This could indicate that DLB features can ultimately result in a more generalizable model.

Project description:ObjectivesThis research aimed to assess the value of radiomics combined with multiple machine learning algorithms in the diagnosis of pancreatic ductal adenocarcinoma (PDAC) lymph node (LN) metastasis, which is expected to provide clinical treatment strategies.MethodsA total of 128 patients with pathologically confirmed PDAC and who underwent surgical resection were randomized into training (n=93) and validation (n=35) groups. This study incorporated a total of 13 distinct machine learning algorithms and explored 85 unique combinations of these algorithms. The area under the curve (AUC) of each model was computed. The model with the highest mean AUC was selected as the best model which was selected to determine the radiomics score (Radscore). The clinical factors were examined by the univariate and multivariate analysis, which allowed for the identification of factors suitable for clinical modeling. The multivariate logistic regression was used to create a combined model using Radscore and clinical variables. The diagnostic performance was assessed by receiver operating characteristic curves, calibration curves, and decision curve analysis (DCA).ResultsAmong the 233 models constructed using arterial phase (AP), venous phase (VP), and AP+VP radiomics features, the model built by applying AP+VP radiomics features and a combination of Lasso+Logistic algorithm had the highest mean AUC. A clinical model was eventually constructed using CA199 and tumor size. The combined model consisted of AP+VP-Radscore and two clinical factors that showed the best diagnostic efficiency in the training (AUC = 0.920) and validation (AUC = 0.866) cohorts. Regarding preoperative diagnosis of LN metastasis, the calibration curve and DCA demonstrated that the combined model had a good consistency and greatest net benefit.ConclusionsCombining radiomics and machine learning algorithms demonstrated the potential for identifying the LN metastasis of PDAC. As a non-invasive and efficient preoperative prediction tool, it can be beneficial for decision-making in clinical practice.

Project description:BackgroundAt present, preoperative diagnosis of lateral cervical lymph node metastasis (LLNM) in patients with papillary thyroid carcinoma (PTC) mostly depends on the training and expertise of ultrasound doctors. A machine-learning model for predicting LLNM accurately before PTC surgery may help to determine the scope of surgery and reduce unnecessary surgical trauma.MethodsThe data of patients with primary PTC who underwent thyroidectomy with lateral cervical lymph node surgery at Beijing Tongren Hospital between July 2009 and June 2021 were retrospectively analyzed. All patients had complete ultrasonic examination, clinical data, and definite pathology diagnosis of lymph nodes. LLNM was confirmed by postoperative pathology. The patients were randomly divided into a training set (155 cases) and a test set (98 cases) at a ratio of 6:4. Eleven parameters, including patient demographics, ultrasound results, and tumor-related conditions, were collected, and a prediction model was established using the support vector machine (SVM) algorithm. Several other machine-learning algorithms were also used to establish models for comparison. The accuracy, precision, recall, F1-score, sensitivity, specificity, Cohen's kappa value, and area under the receiver operating characteristic curve (AUC) were used to evaluate model performance.ResultsA total of 87 males and 156 females were included in the study, aged 14-80 years. One hundred and four patients of them had LLNM and 139 did not have LLNM. The pandas Python library was used for the statistical analysis, and the Spearman coefficient was used to analyze the correlation between each parameter and the prediction index. The SVM model performed the best among all the models. Its accuracy, precision, recall, F1-score, sensitivity, specificity, Cohen's kappa value, and AUC were 90.8%, 91.0%, 90.8%, 90.8%, 87.5%, 94.0%, 81.6%, and 91.0%, respectively.ConclusionsThis model can enable surgeons to improve the accuracy of ultrasonography in predicting LLNM without additional examination, thus avoiding missing positive lateral cervical lymph nodes and reducing the sequelae caused by unnecessary lateral neck dissection.

Project description:BackgroundGastric cancer, a pervasive malignancy globally, often presents with regional lymph node metastasis (LNM), profoundly impacting prognosis and treatment options. Existing clinical methods for determining the presence of LNM are not precise enough, necessitating the development of an accurate risk prediction model.ObjectiveOur primary objective was to employ machine learning algorithms to identify risk factors for LNM and establish a precise prediction model for stage II-III gastric cancer.MethodsA study was conducted at Renji Hospital Affiliated to Shanghai Jiao Tong University School of Medicine between May 2010 and December 2022. This retrospective study analyzed 1147 surgeries for gastric cancer and explored the clinicopathological differences between LNM and non-LNM cohorts. Utilizing univariate logistic regression and two machine learning methodologies-Least absolute shrinkage and selection operator (LASSO) and random forest (RF)-we identified vascular invasion, maximum tumor diameter, percentage of monocytes, hematocrit (HCT), and lymphocyte-monocyte ratio (LMR) as salient factors and consolidated them into a nomogram model. The area under the receiver operating characteristic (ROC) curve (AUC), calibration curves, and decision curves were used to evaluate the test efficacy of the nomogram. Shapley Additive Explanation (SHAP) values were utilized to illustrate the predictive impact of each feature on the model's output.ResultsSignificant differences in tumor characteristics were discerned between LNM and non-LNM cohorts through appropriate statistical methods. A nomogram, incorporating vascular invasion, maximum tumor diameter, percentage of monocytes, HCT, and LMR, was developed and exhibited satisfactory predictive capabilities with an AUC of 0.787 (95% CI: 0.749-0.824) in the training set and 0.753 (95% CI: 0.694-0.812) in the validation set. Calibration curves and decision curves affirmed the nomogram's predictive accuracy.ConclusionIn conclusion, leveraging machine learning algorithms, we devised a nomogram for precise LNM risk prognostication in stage II-III gastric cancer, offering a valuable tool for tailored risk assessment in clinical decision-making.

Dataset Information

Preoperative Prediction of Lymph Node Metastasis in Patients With Early-T-Stage Non-small Cell Lung Cancer by Machine Learning Algorithms.

Publications

Preoperative Prediction of Lymph Node Metastasis in Patients With Early-T-Stage Non-small Cell Lung Cancer by Machine Learning Algorithms.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets