Unknown

Dataset Information

0

Retrospective comparison of approaches to evaluating inter-observer variability in CT tumour measurements in an academic health centre.


ABSTRACT: BACKGROUND:A growing number of research studies have reported inter-observer variability in sizes of tumours measured from CT scans. It remains unclear whether the conventional statistical measures correctly evaluate the CT measurement consistency for optimal treatment management and decision-making. We compared and evaluated the existing measures for evaluating inter-observer variability in CT measurement of cancer lesions. METHODS:13 board-certified radiologists repeatedly reviewed 10 CT image sets of lung lesions and hepatic metastases selected through a randomisation process. A total of 130 measurements under RECIST 1.1 (Response Evaluation Criteria in Solid Tumors) guidelines were collected for the demonstration. Intraclass correlation coefficient (ICC), Bland-Altman plotting and outlier counting methods were selected for the comparison. The each selected measure was used to evaluate three cases with observed, increased and decreased inter-observer variability. RESULTS:The ICC score yielded a weak detection when evaluating different levels of the inter-observer variability among radiologists (increased: 0.912; observed: 0.962; decreased: 0.990). The outlier counting method using Bland-Altman plotting with 2SD yielded no detection at all with its number of outliers unchanging regardless of level of inter-observer variability. Outlier counting based on domain knowledge was more sensitised to different levels of the inter-observer variability compared with the conventional measures (increased: 0.756; observed: 0.923; improved: 1.000). Visualisation of pairwise Bland-Altman bias was also sensitised to the inter-observer variability with its pattern rapidly changing in response to different levels of the inter-observer variability. CONCLUSIONS:Conventional measures may yield weak or no detection when evaluating different levels of the inter-observer variability among radiologists. We observed that the outlier counting based on domain knowledge was sensitised to the inter-observer variability in CT measurement of cancer lesions. Our study demonstrated that, under certain circumstances, the use of standard statistical correlation coefficients may be misleading and result in a sense of false security related to the consistency of measurement for optimal treatment management and decision-making.

SUBMITTER: Woo M 

PROVIDER: S-EPMC7668356 | biostudies-literature | 2020 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Retrospective comparison of approaches to evaluating inter-observer variability in CT tumour measurements in an academic health centre.

Woo MinJae M   Heo Moonseong M   Devane A Michael AM   Lowe Steven C SC   Gimbel Ronald W RW  

BMJ open 20201114 11


<h4>Background</h4>A growing number of research studies have reported inter-observer variability in sizes of tumours measured from CT scans. It remains unclear whether the conventional statistical measures correctly evaluate the CT measurement consistency for optimal treatment management and decision-making. We compared and evaluated the existing measures for evaluating inter-observer variability in CT measurement of cancer lesions.<h4>Methods</h4>13 board-certified radiologists repeatedly revie  ...[more]

Similar Datasets

| S-EPMC10813881 | biostudies-literature
| S-EPMC10392662 | biostudies-literature
| S-EPMC4972882 | biostudies-literature
| S-EPMC7218042 | biostudies-literature
| S-EPMC7807627 | biostudies-literature
| S-EPMC10017115 | biostudies-literature
| S-EPMC9860418 | biostudies-literature
| S-EPMC7233445 | biostudies-literature
| S-EPMC8934266 | biostudies-literature
| S-EPMC7105134 | biostudies-literature