Dataset Information

Machine Scoring of Medical Students' Written Clinical Reasoning: Initial Validity Evidence.

ABSTRACT:

Purpose

Developing medical students' clinical reasoning requires a structured longitudinal curriculum with frequent targeted assessment and feedback. Performance-based assessments, which have the strongest validity evidence, are currently not feasible for this purpose because they are time-intensive to score. This study explored the potential of using machine learning technologies to score one such assessment-the diagnostic justification essay.

Method

From May to September 2018, machine scoring algorithms were trained to score a sample of 700 diagnostic justification essays written by 414 third-year medical students from the Southern Illinois University School of Medicine classes of 2012-2017. The algorithms applied semantically based natural language processing metrics (e.g., coherence, readability) to assess essay quality on 4 criteria (differential diagnosis, recognition and use of findings, workup, and thought process); the scores for these criteria were summed to create overall scores. Three sources of validity evidence (response process, internal structure, and association with other variables) were examined.

Results

Machine scores correlated more strongly with faculty ratings than faculty ratings did with each other (machine: .28-.53, faculty: .13-.33) and were less case-specific. Machine scores and faculty ratings were similarly correlated with medical knowledge, clinical cognition, and prior diagnostic justification. Machine scores were more strongly associated with clinical communication than were faculty ratings (.43 vs .31).

Conclusions

Machine learning technologies may be useful for assessing medical students' long-form written clinical reasoning. Semantically based machine scoring may capture the communicative aspects of clinical reasoning better than faculty ratings, offering the potential for automated assessment that generalizes to the workplace. These results underscore the potential of machine scoring to capture an aspect of clinical reasoning performance that is difficult to assess with traditional analytic scoring methods. Additional research should investigate machine scoring generalizability and examine its acceptability to trainees and educators.

SUBMITTER: Cianciolo AT

PROVIDER: S-EPMC8243833 | biostudies-literature | 2021 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine Scoring of Medical Students' Written Clinical Reasoning: Initial Validity Evidence.

Cianciolo Anna T AT LaVoie Noelle N Parker James J

Academic medicine : journal of the Association of American Medical Colleges 20210701 7

<h4>Purpose</h4>Developing medical students' clinical reasoning requires a structured longitudinal curriculum with frequent targeted assessment and feedback. Performance-based assessments, which have the strongest validity evidence, are currently not feasible for this purpose because they are time-intensive to score. This study explored the potential of using machine learning technologies to score one such assessment-the diagnostic justification essay.<h4>Method</h4>From May to September 2018, m ...[more]

PMID: 33637657

Similar Datasets

Project description:ObjectiveIt is crucial that teaching faculties determine and remain informed of medical school learners' clinical reasoning competence. We created an innovative assessment method for fourth-year medical students to identify deficiencies in various components of their clinical reasoning ability.MethodsThis was a cross-sectional observational study of fourth-year medical students' reasoning assessments from 2019 to 2022. Teams of four-five trainees questioned standardized patients in clinical scenarios, including fever, abdominal pain, and weight loss. They then individually documented key information to reflect comprehension of patient problems. Trainees were tasked with differentiating diagnoses and associated statuses and reaching the most likely diagnosis along with two tentative diagnoses. The correlations observed between 2020 and 2022 for abdominal pain were analyzed using student t-tests.ResultsA total of 177 students participated in this study. Across the scenarios, there was no significant difference in key information representation scores (56%-58%). Reasoning ability scores were 49% for fever, 57% for abdominal pain, and 61% for weight loss. A comparison between 2020 and 2022 revealed a significant improvement in the objective structured clinical examination scores and differential diagnoses (P < .01). Shortcomings included brief chief complaint duration, lack of detailed presentation, and insufficient description of negative information. Differential diagnosis and diagnostic justification were inadequate for acute and chronic conditions, and disease location clarity within the organ system was lacking. On average, students presented two correct diagnoses.ConclusionsFourth-year medical students exhibited inadequate reasoning abilities, particularly in fever and abdominal pain scenarios, with deficiencies in hypothesis generation and differential diagnosis. Group history-taking with individual reasoning assessment identified students' shortcomings and provided faculty feedback to improve their teaching strategies.

Project description:BACKGROUND:The clinical reasoning process, which requires biomedical knowledge, knowledge about problem-solving strategies, and knowledge about reasons for diagnostic procedures, is a key element of physicians' daily practice but difficult to assess. The aim of this study was to empirically develop a Clinical Reasoning Indicators-History Taking-Scale (CRI-HT-S) and to assess the clinical reasoning ability of advanced medical students during a simulation involving history taking. METHODS:The Clinical Reasoning Indictors-History Taking-Scale (CRI-HT-S) including a 5-point Likert scale for assessment was designed from clinical reasoning indicators identified in a qualitative study in 2017. To assess indicators of clinical reasoning ability, 65 advanced medical students (semester 10, n =?25 versus final year, n =?40) from three medical schools participated in a 360-degree competence assessment in the role of beginning residents during a simulated first workday in hospital. This assessment included a consultation hour with five simulated patients which was videotaped. Videos of 325 patient consultations were assessed using the CRI-HT-S. A factor analysis was conducted and the students' results were compared according to their advancement in undergraduate medical training. RESULTS:The clinical reasoning indicators of the CRI-HT-S loaded on three factors relevant for clinical reasoning: 1) focusing questions, 2) creating context, and 3) securing information. Students reached significantly different scores (p <?.001) for the three factors (factor 1: 4.07?±?.47, factor 2: 3.72?±?.43, factor 3: 2.79?±?.83). Students in semester 10 reached significantly lower scores for factor 3 than students in their final year (p <?.05). CONCLUSIONS:The newly developed CRI-HT-S worked well for quantitative assessment of clinical reasoning indicators during history taking. Its three-factored structure helped to explore different aspects of clinical reasoning. Whether the CRI-HT-S has the potential to be used as a scale in objective structured clinical examinations (OCSEs) or in workplace-based assessments of clinical reasoning has to be investigated in further studies with larger student cohorts.

Project description:BackgroundMedical students are often taught clinical reasoning implicitly, rather than through a formal curriculum. Like qualified health professionals, they engage in a wide range of information seeking and other practices as part of the clinical reasoning process. This increasingly includes seeking out information online and being informed by anecdotal information from social media or peer groups. The aim of this research was to investigate how anecdotes and icon arrays influenced the clinical reasoning process of medical students deciding to prescribe a hypothetical new drug.MethodsA cross-sectional survey design was used. The survey required participants to respond to six hypothetical clinical scenarios in which they were asked to prescribe a hypothetical drug "polypill" for a specific patient. The order of delivery of the six scenarios was randomised for each participant. In response to each scenario, participants indicated how effective they perceived each drug to be. The study received ethics approval from the University of Sydney Human Research Ethics Committee: Protocol No: 2019/001. All participants provided written informed consent before agreeing to participate in the study.ResultsA total of 56 medical students fully completed the survey. Statistical analysis of the responses indicated that the icon array may be effective for highlighting how the polypill reduces CVD risk, reducing the impact of anecdotes on efficacy judgments. Without the icon array, both the positive and negative anecdotes made participants less willing to prescribe the polypill.ConclusionsMedical student clinical reasoning processes appear to be influenced by anecdotal information and data visualisations. The extent of this influence is unclear, but there may be a need to actively educate students about the influence of these factors on their decision-making as they graduate into a world where they will be increasingly interacting with anecdotal information on social media and visualisations of electronic data.

Dataset Information

Machine Scoring of Medical Students' Written Clinical Reasoning: Initial Validity Evidence.

Purpose

Method

Results

Conclusions

Publications

Machine Scoring of Medical Students' Written Clinical Reasoning: Initial Validity Evidence.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets