Unknown

Dataset Information

0

Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions.


ABSTRACT: While a publicly available benchmark dataset provides a base for the development of new algorithms and comparison of results, hospital-based data collected from the real-world clinical setup is also very important in AI-based medical research for automated disease diagnosis, prediction or classifications as per standard protocol. Primary data must be constantly updated so that the developed algorithms achieve as much accuracy as possible in the regional context. This dataset would support research work related to image segmentation and final classification for a complete decision support system (https://doi.org/10.1016/j.tice.2020.101347) [1]. Liquid-based cytology (LBC) is one of the cervical screening tests. The repository consists of a total of 963 LBC images sub-divided into four sets representing the four classes: NILM, LSIL, HSIL, and SCC. It comprises pre-cancerous and cancerous lesions related to cervical cancer as per standards under The Bethesda System (TBS). The images were captured in 40x magnification using Leica ICC50 HD microscope collected with due consent from 460 patients visiting the O&G department of the public hospital with various gynaecological problems. The images were then viewed and categorized by experts of the pathology department.

SUBMITTER: Hussain E 

PROVIDER: S-EPMC7186519 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions.

Hussain Elima E   Mahanta Lipi B LB   Borah Himakshi H   Das Chandana Ray CR  

Data in brief 20200422


While a publicly available benchmark dataset provides a base for the development of new algorithms and comparison of results, hospital-based data collected from the real-world clinical setup is also very important in AI-based medical research for automated disease diagnosis, prediction or classifications as per standard protocol. Primary data must be constantly updated so that the developed algorithms achieve as much accuracy as possible in the regional context. This dataset would support resear  ...[more]

Similar Datasets

| S-EPMC9689383 | biostudies-literature
| S-EPMC8192784 | biostudies-literature
| S-EPMC9406372 | biostudies-literature
| S-EPMC7581473 | biostudies-literature
| S-EPMC5773149 | biostudies-literature
| S-EPMC4178202 | biostudies-other
| S-EPMC4783961 | biostudies-literature
| S-EPMC7959623 | biostudies-literature
| S-EPMC8157706 | biostudies-literature
| PRJEB25778 | ENA