Ontology highlight
ABSTRACT:
SUBMITTER: Kefeli J
PROVIDER: S-EPMC10441484 | biostudies-literature | 2023 Aug
REPOSITORIES: biostudies-literature
Kefeli Jenna J Tatonetti Nicholas N
medRxiv : the preprint server for health sciences 20230808
In cancer research, pathology report text is a largely un-tapped data source. Pathology reports are routinely generated, more nuanced than structured data, and contain added insight from pathologists. However, there are no publicly-available datasets for benchmarking report-based models. Two recent advances suggest the urgent need for a benchmark dataset. First, improved optical character recognition (OCR) techniques will make it possible to access older pathology reports in an automated way, in ...[more]