Unknown

Dataset Information

0

Performance of large language models (LLMs) in providing prostate cancer information.


ABSTRACT:

Purpose

The diagnosis and management of prostate cancer (PCa), the second most common cancer in men worldwide, are highly complex. Hence, patients often seek knowledge through additional resources, including AI chatbots such as ChatGPT and Google Bard. This study aimed to evaluate the performance of LLMs in providing education on PCa.

Methods

Common patient questions about PCa were collected from reliable educational websites and evaluated for accuracy, comprehensiveness, readability, and stability by two independent board-certified urologists, with a third resolving discrepancy. Accuracy was measured on a 3-point scale, comprehensiveness was measured on a 5-point Likert scale, and readability was measured using the Flesch Reading Ease (FRE) score and Flesch-Kincaid FK Grade Level.

Results

A total of 52 questions on general knowledge, diagnosis, treatment, and prevention of PCa were provided to three LLMs. Although there was no significant difference in the overall accuracy of LLMs, ChatGPT-3.5 demonstrated superiority over the other LLMs in terms of general knowledge of PCa (p = 0.018). ChatGPT-4 achieved greater overall comprehensiveness than ChatGPT-3.5 and Bard (p = 0.028). For readability, Bard generated simpler sentences with the highest FRE score (54.7, p < 0.001) and lowest FK reading level (10.2, p < 0.001).

Conclusion

ChatGPT-3.5, ChatGPT-4 and Bard generate accurate, comprehensive, and easily readable PCa material. These AI models might not replace healthcare professionals but can assist in patient education and guidance.

SUBMITTER: Alasker A 

PROVIDER: S-EPMC11342655 | biostudies-literature | 2024 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Performance of large language models (LLMs) in providing prostate cancer information.

Alasker Ahmed A   Alsalamah Seham S   Alshathri Nada N   Almansour Nura N   Alsalamah Faris F   Alghafees Mohammad M   AlKhamees Mohammad M   Alsaikhan Bader B  

BMC urology 20240823 1


<h4>Purpose</h4>The diagnosis and management of prostate cancer (PCa), the second most common cancer in men worldwide, are highly complex. Hence, patients often seek knowledge through additional resources, including AI chatbots such as ChatGPT and Google Bard. This study aimed to evaluate the performance of LLMs in providing education on PCa.<h4>Methods</h4>Common patient questions about PCa were collected from reliable educational websites and evaluated for accuracy, comprehensiveness, readabil  ...[more]

Similar Datasets

| S-EPMC11462564 | biostudies-literature
| S-EPMC10580627 | biostudies-literature
| S-EPMC11609263 | biostudies-literature
| S-EPMC11603009 | biostudies-literature
| S-EPMC11530873 | biostudies-literature
| S-EPMC11231310 | biostudies-literature
| S-EPMC11754967 | biostudies-literature
| S-EPMC10547367 | biostudies-literature
| S-EPMC11376538 | biostudies-literature
| S-EPMC10869356 | biostudies-literature