Dataset Information

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.

ABSTRACT:

Importance

An increasing number of machine learning (ML)-based clinical decision support systems (CDSSs) are described in the medical literature, but this research focuses almost entirely on comparing CDSS directly with clinicians (human vs computer). Little is known about the outcomes of these systems when used as adjuncts to human decision-making (human vs human with computer).

Objectives

To conduct a systematic review to investigate the association between the interactive use of ML-based diagnostic CDSSs and clinician performance and to examine the extent of the CDSSs' human factors evaluation.

Evidence review

A search of MEDLINE, Embase, PsycINFO, and grey literature was conducted for the period between January 1, 2010, and May 31, 2019. Peer-reviewed studies published in English comparing human clinician performance with and without interactive use of an ML-based diagnostic CDSSs were included. All metrics used to assess human performance were considered as outcomes. The risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) and Risk of Bias in Non-Randomised Studies-Intervention (ROBINS-I). Narrative summaries were produced for the main outcomes. Given the heterogeneity of medical conditions, outcomes of interest, and evaluation metrics, no meta-analysis was performed.

Findings

A total of 8112 studies were initially retrieved and 5154 abstracts were screened; of these, 37 studies met the inclusion criteria. The median number of participating clinicians was 4 (interquartile range, 3-8). Of the 107 results that reported statistical significance, 54 (50%) were increased by the use of CDSSs, 4 (4%) were decreased, and 49 (46%) showed no change or an unclear change. In the subgroup of studies carried out in representative clinical settings, no association between the use of ML-based diagnostic CDSSs and improved clinician performance could be observed. Interobserver agreement was the commonly reported outcome whose change was the most strongly associated with CDSS use. Four studies (11%) reported on user feedback, and, in all but 1 case, clinicians decided to override at least some of the algorithms' recommendations. Twenty-eight studies (76%) were rated as having a high risk of bias in at least 1 of the 4 QUADAS-2 core domains, and 6 studies (16%) were considered to be at serious or critical risk of bias using ROBINS-I.

Conclusions and relevance

This systematic review found only sparse evidence that the use of ML-based CDSSs is associated with improved clinician diagnostic performance. Most studies had a low number of participants, were at high or unclear risk of bias, and showed little or no consideration for human factors. Caution should be exercised when estimating the current potential of ML to improve human diagnostic performance, and more comprehensive evaluation should be conducted before deploying ML-based CDSSs in clinical settings. The results highlight the importance of considering supported human decisions as end points rather than merely the stand-alone CDSSs outputs.

SUBMITTER: Vasey B

PROVIDER: S-EPMC7953308 | biostudies-literature | 2021 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.

Vasey Baptiste B Ursprung Stephan S Beddoe Benjamin B Taylor Elliott H EH Marlow Neale N Bilbro Nicole N Watkinson Peter P McCulloch Peter P

JAMA network open 20210301 3

<h4>Importance</h4>An increasing number of machine learning (ML)-based clinical decision support systems (CDSSs) are described in the medical literature, but this research focuses almost entirely on comparing CDSS directly with clinicians (human vs computer). Little is known about the outcomes of these systems when used as adjuncts to human decision-making (human vs human with computer).<h4>Objectives</h4>To conduct a systematic review to investigate the association between the interactive use o ...[more]

PMID: 33704476

Similar Datasets

Project description:BackgroundLung cancer treatment decisions are typically made among clinical experts in a multidisciplinary tumour board (MTB) based on clinical data and guidelines. The rise of artificial intelligence and cultural shifts towards patient autonomy are changing the nature of clinical decision-making towards personalized treatments. This can be supported by clinical decision support systems (CDSSs) that generate personalized treatment information as a basis for shared decision-making (SDM). Little is known about lung cancer patients' treatment decisions and the potential for SDM supported by CDSSs. The aim of this study is to understand to what extent SDM is done in current practice and what clinicians need to improve it.ObjectiveTo explore (1) the extent to which patient preferences are taken into consideration in non-small-cell lung cancer (NSCLC) treatment decisions; (2) clinician perspectives on using CDSSs to support SDM.DesignMixed methods study consisting of a retrospective cohort study on patient deviation from MTB advice and reasons for deviation, qualitative interviews with lung cancer specialists and observations of MTB discussions and patient consultations.Setting and participantsNSCLC patients (N = 257) treated at a single radiotherapy clinic and nine lung cancer specialists from six Dutch clinics.ResultsWe found a 10.9% (n = 28) deviation rate from MTB advice; 50% (n = 14) were due to patient preference, of which 85.7% (n = 12) chose a less intensive treatment than MTB advice. Current MTB recommendations are based on clinician experience, guidelines and patients' performance status. Most specialists (n = 7) were receptive towards CDSSs but cited barriers, such as lack of trust, lack of validation studies and time. CDSSs were considered valuable during MTB discussions rather than in consultations.ConclusionLung cancer decisions are heavily influenced by clinical guidelines and experience, yet many patients prefer less intensive treatments. CDSSs can support SDM by presenting the harms and benefits of different treatment options rather than giving single treatment advice. External validation of CDSSs should be prioritized.Patient or public contributionThis study did not involve patients or the public explicitly; however, the study design was informed by prior interviews with volunteers of a cancer patient advocacy group. The study objectives and data collection were supported by Dutch health care insurer CZ for a project titled 'My Best Treatment' that improves patient-centeredness and the lung cancer patient pathway in the Netherlands.

Project description:The process of extracting oil and gas via borehole drilling is largely dependent on subsurface structures, and thus, well log analysis is a major concern for economic feasibility. Well logs are essential for understanding the geology below the earth's surface, which allows for the estimation of the available hydrocarbon resources. The incompleteness of these logs, on the other hand, is a major hindrance to downstream analysis success. This study, however, addresses the above challenges and presents a deep Long-Short Term Memory (LSTM) model specialized using a new hyperparameter tuning algorithm. There is an evidence gap that we try to fill: well log prediction using LSTM has not been extensively documented, particularly on reconstruction of missing data. In order to remedy this, we develop a new algorithm entitled Elite Preservation Strategy Chimp Optimization Algorithm (EPSCHOA), which will improve the tuning of LSTM hyperparameters. EPSCHOA enhances prediction performance by preserving the diversity of the strongest candidates and transforming the most effective predictor resources into less effective ones. A comparative analysis of the LSTM-EPSCHOA model was carried out with both LSTM and E-LSTM models, including their various extensions, LSTM-CHOA, LSTM-HGSA, LSTM-IMPA, LSTM-SEB-CHOA, and LSTM-GOLCHOA, even as common forecasting models using Artificial Neural Network (ANN), Adaptive Neuro-Fuzzy Inference System (ANFIS), Gradient Boosting (GB), and AutoRegressive Integrated Moving Average (ARIMA). The results of the performance tests demonstrate that the LSTM-EPSCHOA model outperforms in all aspects, as evidenced by its R2 values of.98, RMSE of 0.022, and MAPE of 0.701% during training, and R2 values of 0.96, RMSE of 0.025, and MAPE of 0.698% during testing. These are considerably superior to other measures used compared to what was achieved using explicit modeling using LSTM, which stood at R2 of 0.59, RMSE of 0.101, and MAPE of 2.588%. The LSTM-EPSCHOA proved to give models faster rates of convergence and lower error measurements than usual models, which clearly demonstrated its efficiency in solving the problem of inadequate well-log data. The new approach is regarded as having many useful potentials to boost well-log interpretations in the oil sector.

Project description:BACKGROUND:Computerized decision support systems (CDSSs) are software programs that support the decision making of practitioners and other staff. Other reviews have analyzed the relationship between CDSSs, practitioner performance, and patient outcomes. These reviews reported positive practitioner performance in over half the articles analyzed, but very little information was found for patient outcomes. OBJECTIVE:The purpose of this review was to analyze the relationship between CDSSs, practitioner performance, and patient medical outcomes. PubMed, CINAHL, Embase, Web of Science, and Cochrane databases were queried. METHODS:Articles were chosen based on year published (last 10 years), high quality, peer-reviewed sources, and discussion of the relationship between the use of CDSS as an intervention and links to practitioner performance or patient outcomes. Reviewers used an Excel spreadsheet (Microsoft Corporation) to collect information on the relationship between CDSSs and practitioner performance or patient outcomes. Reviewers also collected observations of participants, intervention, comparison with control group, outcomes, and study design (PICOS) along with those showing implicit bias. Articles were analyzed by multiple reviewers following the Kruse protocol for systematic reviews. Data were organized into multiple tables for analysis and reporting. RESULTS:Themes were identified for both practitioner performance (n=38) and medical outcomes (n=36). A total of 66% (25/38) of articles had occurrences of positive practitioner performance, 13% (5/38) found no difference in practitioner performance, and 21% (8/38) did not report or discuss practitioner performance. Zero articles reported negative practitioner performance. A total of 61% (22/36) of articles had occurrences of positive patient medical outcomes, 8% (3/36) found no statistically significant difference in medical outcomes between intervention and control groups, and 31% (11/36) did not report or discuss medical outcomes. Zero articles found negative patient medical outcomes attributed to using CDSSs. CONCLUSIONS:Results of this review are commensurate with previous reviews with similar objectives, but unlike these reviews we found a high level of reporting of positive effects on patient medical outcomes.

Project description:BackgroundMachine learning (ML)-based clinical decision support systems (CDSS) are popular in clinical practice settings but are often criticized for being limited in usability, interpretability, and effectiveness. Evaluating the implementation of ML-based CDSS is critical to ensure CDSS is acceptable and useful to clinicians and helps them deliver high-quality health care. Malnutrition is a common and underdiagnosed condition among hospital patients, which can have serious adverse impacts. Early identification and treatment of malnutrition are important.ObjectiveThis study aims to evaluate the implementation of an ML tool, Malnutrition Universal Screening Tool (MUST)-Plus, that predicts hospital patients at high risk for malnutrition and identify best implementation practices applicable to this and other ML-based CDSS.MethodsWe conducted a qualitative postimplementation evaluation using in-depth interviews with registered dietitians (RDs) who use MUST-Plus output in their everyday work. After coding the data, we mapped emergent themes onto select domains of the nonadoption, abandonment, scale-up, spread, and sustainability (NASSS) framework.ResultsWe interviewed 17 of the 24 RDs approached (71%), representing 37% of those who use MUST-Plus output. Several themes emerged: (1) enhancements to the tool were made to improve accuracy and usability; (2) MUST-Plus helped identify patients that would not otherwise be seen; perceived usefulness was highest in the original site; (3) perceived accuracy varied by respondent and site; (4) RDs valued autonomy in prioritizing patients; (5) depth of tool understanding varied by hospital and level; (6) MUST-Plus was integrated into workflows and electronic health records; and (7) RDs expressed a desire to eventually have 1 automated screener.ConclusionsOur findings suggest that continuous involvement of stakeholders at new sites given staff turnover is vital to ensure buy-in. Qualitative research can help identify the potential bias of ML tools and should be widely used to ensure health equity. Ongoing collaboration among CDSS developers, data scientists, and clinical providers may help refine CDSS for optimal use and improve the acceptability of CDSS in the clinical context.

Project description:Underuse and overuse of diagnostic tests have important implications for health outcomes and costs. Decision support technology purports to optimize the use of diagnostic tests in clinical practice. The objective of this review was to assess whether computerized clinical decision support systems (CCDSSs) are effective at improving ordering of tests for diagnosis, monitoring of disease, or monitoring of treatment. The outcome of interest was effect on the diagnostic test-ordering behavior of practitioners.We conducted a decision-maker-researcher partnership systematic review. We searched MEDLINE, EMBASE, Ovid's EBM Reviews database, Inspec, and reference lists for eligible articles published up to January 2010. We included randomized controlled trials comparing the use of CCDSSs to usual practice or non-CCDSS controls in clinical care settings. Trials were eligible if at least one component of the CCDSS gave suggestions for ordering or performing a diagnostic procedure. We considered studies 'positive' if they showed a statistically significant improvement in at least 50% of test ordering outcomes.Thirty-five studies were identified, with significantly higher methodological quality in those published after the year 2000 (p = 0.002). Thirty-three trials reported evaluable data on diagnostic test ordering, and 55% (18/33) of CCDSSs improved testing behavior overall, including 83% (5/6) for diagnosis, 63% (5/8) for treatment monitoring, 35% (6/17) for disease monitoring, and 100% (3/3) for other purposes. Four of the systems explicitly attempted to reduce test ordering rates and all succeeded. Factors of particular interest to decision makers include costs, user satisfaction, and impact on workflow but were rarely investigated or reported.Some CCDSSs can modify practitioner test-ordering behavior. To better inform development and implementation efforts, studies should describe in more detail potentially important factors such as system design, user interface, local context, implementation strategy, and evaluate impact on user satisfaction and workflow, costs, and unintended consequences.

Dataset Information

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.

Importance

Objectives

Evidence review

Findings

Conclusions and relevance

Publications

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets