Unknown

Dataset Information

0

Scaling multi-instance support vector machine to breast cancer detection on the BreaKHis dataset.


ABSTRACT:

Motivation

Breast cancer is a type of cancer that develops in breast tissues, and, after skin cancer, it is the most commonly diagnosed cancer in women in the United States. Given that an early diagnosis is imperative to prevent breast cancer progression, many machine learning models have been developed in recent years to automate the histopathological classification of the different types of carcinomas. However, many of them are not scalable to large-scale datasets.

Results

In this study, we propose the novel Primal-Dual Multi-Instance Support Vector Machine to determine which tissue segments in an image exhibit an indication of an abnormality. We derive an efficient optimization algorithm for the proposed objective by bypassing the quadratic programming and least-squares problems, which are commonly employed to optimize Support Vector Machine models. The proposed method is computationally efficient, thereby it is scalable to large-scale datasets. We applied our method to the public BreaKHis dataset and achieved promising prediction performance and scalability for histopathological classification.

Availability and implementation

Software is publicly available at: https://1drv.ms/u/s!AiFpD21bgf2wgRLbQq08ixD0SgRD?e=OpqEmY.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Seo H 

PROVIDER: S-EPMC9235475 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3716652 | biostudies-literature
| S-EPMC3737136 | biostudies-literature
| S-EPMC8894221 | biostudies-literature
| S-EPMC3641962 | biostudies-literature
| S-EPMC8287651 | biostudies-literature
| S-EPMC4556031 | biostudies-literature
| S-EPMC8668356 | biostudies-literature
| S-EPMC2680413 | biostudies-literature
| S-EPMC6905648 | biostudies-literature
| S-EPMC9189781 | biostudies-literature