Dataset Information

Development and Application of a Machine Learning Approach to Assess Short-term Mortality Risk Among Patients With Cancer Starting Chemotherapy.

ABSTRACT: Importance:Patients with cancer who die soon after starting chemotherapy incur costs of treatment without the benefits. Accurately predicting mortality risk before administering chemotherapy is important, but few patient data-driven tools exist. Objective:To create and validate a machine learning model that predicts mortality in a general oncology cohort starting new chemotherapy, using only data available before the first day of treatment. Design, Setting, and Participants:This retrospective cohort study of patients at a large academic cancer center from January 1, 2004, through December 31, 2014, determined date of death by linkage to Social Security data. The model was derived using data from 2004 through 2011, and performance was measured on nonoverlapping data from 2012 through 2014. The analysis was conducted from June 1 through August 1, 2017. Participants included 26?946 patients starting 51?774 new chemotherapy regimens. Main Outcomes and Measures:Thirty-day mortality from the first day of a new chemotherapy regimen. Secondary outcomes included model discrimination by predicted mortality risk decile among patients receiving palliative chemotherapy, and 180-day mortality from the first day of a new chemotherapy regimen. Results:Among the 26?946 patients included in the analysis, mean age was 58.7 years (95% CI, 58.5-58.9 years); 61.1% were female (95% CI, 60.4%-61.9%); and 86.9% were white (95% CI, 86.4%-87.4%). Thirty-day mortality from chemotherapy start was 2.1% (95% CI, 1.9%-2.4%). Among the 9114 patients in the validation set, the most common primary cancers were breast (21.1%; 95% CI, 20.2%-21.9%), colorectal (19.3%; 95% CI, 18.5%-20.2%), and lung (18.0%; 95% CI, 17.2%-18.8%). Model predictions were accurate for all patients (area under the curve [AUC], 0.940; 95% CI, 0.930-0.951). Predictions for patients starting palliative chemotherapy (46.6% of regimens; 95% CI, 45.8%-47.3%), for whom prognosis is particularly important, remained highly accurate (AUC, 0.924; 95% CI, 0.910-0.939). To illustrate model discrimination, patients were ranked initiating palliative chemotherapy by model-predicted mortality risk, and observed mortality was calculated by risk decile. Thirty-day mortality in the highest-risk decile was 22.6% (95% CI, 19.6%-25.6%); in the lowest-risk decile, no patients died. Predictions remained accurate across all primary cancers, stages, and chemotherapies, even for clinical trial regimens that first appeared in years after the model was trained (AUC, 0.942; 95% CI, 0.882-1.000). The same model also performed well for prediction of 180-day mortality (AUC for all patients, 0.870 [95% CI, 0.862-0.877]; highest- vs lowest-risk decile mortality, 74.8% [95% CI, 72.7%-77.0%] vs 0.2% [95% CI, 0.01%-0.4%]). Predictions were more accurate than estimates from randomized clinical trials of individual chemotherapies or the Surveillance, Epidemiology, and End Results data set. Conclusions and Relevance:A machine learning algorithm using electronic health record data accurately predicted short-term mortality among patients starting chemotherapy. Further research is necessary to determine the generalizability and feasibility of applying this algorithm in clinical settings.

SUBMITTER: Elfiky AA

PROVIDER: S-EPMC6324307 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Development and Application of a Machine Learning Approach to Assess Short-term Mortality Risk Among Patients With Cancer Starting Chemotherapy.

Elfiky Aymen A AA Pany Maximilian J MJ Parikh Ravi B RB Obermeyer Ziad Z

JAMA network open 20180706 3

<h4>Importance</h4>Patients with cancer who die soon after starting chemotherapy incur costs of treatment without the benefits. Accurately predicting mortality risk before administering chemotherapy is important, but few patient data-driven tools exist.<h4>Objective</h4>To create and validate a machine learning model that predicts mortality in a general oncology cohort starting new chemotherapy, using only data available before the first day of treatment.<h4>Design, setting, and participants</h4 ...[more]

PMID: 30646043

Similar Datasets

Project description:AimsFrailty may be found in heart failure patients especially in the elderly and is associated with a poor prognosis. However, assessment of frailty status is time-consuming, and the electronic frailty indices developed using health records have served as useful surrogates. We hypothesized that an electronic frailty index developed using machine learning can improve short-term mortality prediction in patients with heart failure.Methods and resultsThis was a retrospective observational study that included patients admitted to nine public hospitals for heart failure from Hong Kong between 2013 and 2017. Age, sex, variables in the modified frailty index, Deyo's Charlson co-morbidity index (≥2), neutrophil-to-lymphocyte ratio (NLR), and prognostic nutritional index at baseline were analysed. Gradient boosting, which is a supervised sequential ensemble learning algorithm with weak prediction submodels (typically decision trees), was applied to predict mortality. Variables were ranked in the order of importance with a total score of 100 and used to build the frailty models. Comparisons were made with decision tree and multivariable logistic regression. A total of 8893 patients (median: age 81, Q1-Q3: 71-87 years old) were included, in whom 9% had 30 day mortality and 17% had 90 day mortality. Prognostic nutritional index, age, and NLR were the most important variables predicting 30 day mortality (importance score: 37.4, 32.1, and 20.5, respectively) and 90 day mortality (importance score: 35.3, 36.3, and 14.6, respectively). Gradient boosting significantly outperformed decision tree and multivariable logistic regression. The area under the curve from a five-fold cross validation was 0.90 for gradient boosting and 0.87 and 0.86 for decision tree and logistic regression in predicting 30 day mortality. For the prediction of 90 day mortality, the area under the curve was 0.92, 0.89, and 0.86 for gradient boosting, decision tree, and logistic regression, respectively.ConclusionsThe electronic frailty index based on co-morbidities, inflammation, and nutrition information can readily predict mortality outcomes. Their predictive performances were significantly improved by gradient boosting techniques.

Project description:BackgroundConventional risk score for predicting short and long-term mortality following an ST-segment elevation myocardial infarction (STEMI) is often not population specific.ObjectiveApply machine learning for the prediction and identification of factors associated with short and long-term mortality in Asian STEMI patients and compare with a conventional risk score.MethodsThe National Cardiovascular Disease Database for Malaysia registry, of a multi-ethnic, heterogeneous Asian population was used for in-hospital (6299 patients), 30-days (3130 patients), and 1-year (2939 patients) model development. 50 variables were considered. Mortality prediction was analysed using feature selection methods with machine learning algorithms and compared to Thrombolysis in Myocardial Infarction (TIMI) score. Invasive management of varying degrees was selected as important variables that improved mortality prediction.ResultsModel performance using a complete and reduced variable produced an area under the receiver operating characteristic curve (AUC) from 0.73 to 0.90. The best machine learning model for in-hospital, 30 days, and 1-year outperformed TIMI risk score (AUC = 0.88, 95% CI: 0.846-0.910; vs AUC = 0.81, 95% CI:0.772-0.845, AUC = 0.90, 95% CI: 0.870-0.935; vs AUC = 0.80, 95% CI: 0.746-0.838, AUC = 0.84, 95% CI: 0.798-0.872; vs AUC = 0.76, 95% CI: 0.715-0.802, p < 0.0001 for all). TIMI score underestimates patients' risk of mortality. 90% of non-survival patients are classified as high risk (>50%) by machine learning algorithm compared to 10-30% non-survival patients by TIMI. Common predictors identified for short- and long-term mortality were age, heart rate, Killip class, fasting blood glucose, prior primary PCI or pharmaco-invasive therapy and diuretics. The final algorithm was converted into an online tool with a database for continuous data archiving for algorithm validation.ConclusionsIn a multi-ethnic population, patients with STEMI were better classified using the machine learning method compared to TIMI scoring. Machine learning allows for the identification of distinct factors in individual Asian populations for better mortality prediction. Ongoing continuous testing and validation will allow for better risk stratification and potentially alter management and outcomes in the future.

Project description:Study designA retrospective machine learning (ML) classification study for prognostic modeling after anterior cervical corpectomy (ACC).PurposeTo evaluate the effectiveness of ML in predicting ACC outcomes and develop an accessible, user-friendly tool for this purpose.Overview of literatureBased on our literature review, no study has examined the capability of ML algorithms to predict major shortterm ACC outcomes, such as prolonged length of hospital stay (LOS), non-home discharge, and major complications.MethodsThe American College of Surgeons' National Surgical Quality Improvement Program database was used to identify patients who underwent ACC. Prolonged LOS, non-home discharges, and major complications were assessed as the outcomes of interest. ML models were developed with the TabPFN algorithm and integrated into an open-access website to predict these outcomes.ResultsThe models for predicting prolonged LOS, non-home discharges, and major complications demonstrated mean areas under the receiver operating characteristic curve (AUROC) of 0.802, 0.816, and 0.702, respectively. These findings highlight the discriminatory capacities of the models: fair (AUROC >0.7) for differentiating patients with major complications from those without, and good (AUROC >0.8) for distinguishing between those with and without prolonged LOS and non-home discharges. According to the SHapley Additive Explanations analysis, single- versus multiple-level surgery, age, body mass index, preoperative hematocrit, and American Society of Anesthesiologists physical status repetitively emerged as the most important variables for each outcome.ConclusionsThis study has considerably enhanced the prediction of postoperative results after ACC surgery by implementing advanced ML techniques. A major contribution is the creation of an accessible web application, highlighting the practical value of the developed models. Our findings imply that ML can serve as an invaluable supplementary tool to stratify patient risk for this procedure and can predict diverse postoperative adverse outcomes.

Project description:ImportanceChest radiography is the most common diagnostic imaging test in medicine and may also provide information about longevity and prognosis.ObjectiveTo develop and test a convolutional neural network (CNN) (named CXR-risk) to predict long-term mortality, including noncancer death, from chest radiographs.Design, setting, and participantsIn this prognostic study, CXR-risk CNN development (n = 41 856) and testing (n = 10 464) used data from the screening radiography arm of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO) (n = 52 320), a community cohort of asymptomatic nonsmokers and smokers (aged 55-74 years) enrolled at 10 US sites from November 8, 1993, through July 2, 2001. External testing used data from the screening radiography arm of the National Lung Screening Trial (NLST) (n = 5493), a community cohort of heavy smokers (aged 55-74 years) enrolled at 21 US sites from August 2002, through April 2004. Data analysis was performed from January 1, 2018, to May 23, 2019.ExposureDeep learning CXR-risk score (very low, low, moderate, high, and very high) based on CNN analysis of the enrollment radiograph.Main outcomes and measuresAll-cause mortality. Prognostic value was assessed in the context of radiologists' diagnostic findings (eg, lung nodule) and standard risk factors (eg, age, sex, and diabetes) and for cause-specific mortality.ResultsAmong 10 464 PLCO participants (mean [SD] age, 62.4 [5.4] years; 5405 men [51.6%]; median follow-up, 12.2 years [interquartile range, 10.5-12.9 years]) and 5493 NLST test participants (mean [SD] age, 61.7 [5.0] years; 3037 men [55.3%]; median follow-up, 6.3 years [interquartile range, 6.0-6.7 years]), there was a graded association between CXR-risk score and mortality. The very high-risk group had mortality of 53.0% (PLCO) and 33.9% (NLST), which was higher compared with the very low-risk group (PLCO: unadjusted hazard ratio [HR], 18.3 [95% CI, 14.5-23.2]; NLST: unadjusted HR, 15.2 [95% CI, 9.2-25.3]; both P < .001). This association was robust to adjustment for radiologists' findings and risk factors (PLCO: adjusted HR [aHR], 4.8 [95% CI, 3.6-6.4]; NLST: aHR, 7.0 [95% CI, 4.0-12.1]; both P < .001). Comparable results were seen for lung cancer death (PLCO: aHR, 11.1 [95% CI, 4.4-27.8]; NLST: aHR, 8.4 [95% CI, 2.5-28.0]; both P ≤ .001) and for noncancer cardiovascular death (PLCO: aHR, 3.6 [95% CI, 2.1-6.2]; NLST: aHR, 47.8 [95% CI, 6.1-374.9]; both P < .001) and respiratory death (PLCO: aHR, 27.5 [95% CI, 7.7-97.8]; NLST: aHR, 31.9 [95% CI, 3.9-263.5]; both P ≤ .001).Conclusions and relevanceIn this study, the deep learning CXR-risk score stratified the risk of long-term mortality based on a single chest radiograph. Individuals at high risk of mortality may benefit from prevention, screening, and lifestyle interventions.

Project description:BACKGROUND: A standard short-term inhalation study (STIS) was applied for hazard assessment of 13 metal oxide nanomaterials and micron-scale zinc oxide. METHODS: Rats were exposed to test material aerosols (ranging from 0.5 to 50 mg/m3) for five consecutive days with 14- or 21-day post-exposure observation. Bronchoalveolar lavage fluid (BALF) and histopathological sections of the entire respiratory tract were examined. Pulmonary deposition and clearance and test material translocation into extra-pulmonary organs were assessed. RESULTS: Inhaled nanomaterials were found in the lung, in alveolar macrophages, and in the draining lymph nodes. Polyacrylate-coated silica was also found in the spleen, and both zinc oxides elicited olfactory epithelium necrosis. None of the other nanomaterials was recorded in extra-pulmonary organs. Eight nanomaterials did not elicit pulmonary effects, and their no observed adverse effect concentrations (NOAECs) were at least 10 mg/m3. Five materials (coated nano-TiO2, both ZnO, both CeO2) evoked concentration-dependent transient pulmonary inflammation. Most effects were at least partially reversible during the post-exposure period.Based on the NOAECs that were derived from quantitative parameters, with BALF polymorphonuclear (PMN) neutrophil counts and total protein concentration being most sensitive, or from the severity of histopathological findings, the materials were ranked by increasing toxic potency into 3 grades: lower toxic potency: BaSO4; SiO2.acrylate (by local NOAEC); SiO2.PEG; SiO2.phosphate; SiO2.amino; nano-ZrO2; ZrO2.TODA; ZrO2.acrylate; medium toxic potency: SiO2.naked; higher toxic potency: coated nano-TiO2; nano-CeO2; Al-doped nano-CeO2; micron-scale ZnO; coated nano-ZnO (and SiO2.acrylate by systemic no observed effect concentration (NOEC)). CONCLUSION: The STIS revealed the type of effects of 13 nanomaterials, and micron-scale ZnO, information on their toxic potency, and the location and reversibility of effects. Assessment of lung burden and material translocation provided preliminary biokinetic information. Based upon the study results, the STIS protocol was re-assessed and preliminary suggestions regarding the grouping of nanomaterials for safety assessment were spelled out.

Dataset Information

Development and Application of a Machine Learning Approach to Assess Short-term Mortality Risk Among Patients With Cancer Starting Chemotherapy.

Publications

Development and Application of a Machine Learning Approach to Assess Short-term Mortality Risk Among Patients With Cancer Starting Chemotherapy.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets