Dataset Information

Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.

ABSTRACT:

Background

Although averaging across multiple examiners' judgements reduces unwanted overall score variability in objective structured clinical examinations (OSCE), designs involving several parallel circuits of the OSCE require that different examiner cohorts collectively judge performances to the same standard in order to avoid bias. Prior research suggests the potential for important examiner-cohort effects in distributed or national examinations that could compromise fairness or patient safety, but despite their importance, these effects are rarely investigated because fully nested assessment designs make them very difficult to study. We describe initial use of a new method to measure and adjust for examiner-cohort effects on students' scores.

Methods

We developed video-based examiner score comparison and adjustment (VESCA): volunteer students were filmed 'live' on 10 out of 12 OSCE stations. Following the examination, examiners additionally scored station-specific common-comparator videos, producing partial crossing between examiner cohorts. Many-facet Rasch modelling and linear mixed modelling were used to estimate and adjust for examiner-cohort effects on students' scores.

Results

After accounting for students' ability, examiner cohorts differed substantially in their stringency or leniency (maximal global score difference of 0.47 out of 7.0 [Cohen's d = 0.96]; maximal total percentage score difference of 5.7% [Cohen's d = 1.06] for the same student ability by different examiner cohorts). Corresponding adjustment of students' global and total percentage scores altered the theoretical classification of 6.0% of students for both measures (either pass to fail or fail to pass), whereas 8.6-9.5% students' scores were altered by at least 0.5 standard deviations of student ability.

Conclusions

Despite typical reliability, the examiner cohort that students encountered had a potentially important influence on their score, emphasising the need for adequate sampling and examiner training. Development and validation of VESCA may offer a means to measure and adjust for potential systematic differences in scoring patterns that could exist between locations in distributed or national OSCE examinations, thereby ensuring equivalence and fairness.

SUBMITTER: Yeates P

PROVIDER: S-EPMC6519246 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.

Yeates Peter P Cope Natalie N Hawarden Ashley A Bradshaw Hannah H McCray Gareth G Homer Matt M

Medical education 20181221 3

<h4>Background</h4>Although averaging across multiple examiners' judgements reduces unwanted overall score variability in objective structured clinical examinations (OSCE), designs involving several parallel circuits of the OSCE require that different examiner cohorts collectively judge performances to the same standard in order to avoid bias. Prior research suggests the potential for important examiner-cohort effects in distributed or national examinations that could compromise fairness or pati ...[more]

PMID: 30575092

Dataset Information

Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.

Background

Methods

Results

Conclusions

Publications

Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Investigating the accuracy of adjusting for examiner differences in multi-centre Objective Structured Clinical Exams (OSCEs). A simulation study of video-based Examiner Score Comparison and Adjustment (VESCA).
| S-EPMC11654327 | biostudies-literature

Fully laparoscopic pancreaticojejunostomy, Puestow procedure (with video).
| S-EPMC8527657 | biostudies-literature

Metabolomic analysis to define and compare the effects of PAHs and oxygenated PAHs in developing zebrafish.
| S-EPMC4492807 | biostudies-literature

Brain2Pix: Fully convolutional naturalistic video frame reconstruction from brain activity
| S-EPMC9703977 | biostudies-literature

Video-based fully automatic assessment of open surgery suturing skills.
| S-EPMC8805431 | biostudies-literature

Does the Forensic Filler-Control Method Reduce Examiner Overconfidence? An Experimental Investigation Using Mock Fingerprint Examiners.
| S-EPMC12466420 | biostudies-literature

Developing a workflow for the isolation of hybridoma cells producing fully human antigen-specific antibodies using a surface IgG detection method.
| S-EPMC11452657 | biostudies-literature

Using marginal structural models to adjust for treatment drop-in when developing clinical prediction models.
| S-EPMC6282523 | biostudies-literature

A Practical Guide to Adjust Micronutrient Biomarkers for Inflammation Using the BRINDA Method.
| S-EPMC10202121 | biostudies-literature

A rigorous evaluation of a method to adjust BMI for self-report bias.
| S-EPMC8942077 | biostudies-literature