Dataset Information

Pooling region learning of visual word for image classification using bag-of-visual-words model.

ABSTRACT: In the problem where there is not enough data to use Deep Learning, Bag-of-Visual-Words (BoVW) is still a good alternative for image classification. In BoVW model, many pooling methods are proposed to incorporate the spatial information of local feature into the image representation vector, but none of the methods devote to making each visual word have its own pooling regions. The practice of designing the same pooling regions for all the words restrains the discriminability of image representation, since the spatial distributions of the local features indexed by different visual words are not same. In this paper, we propose to make each visual word have its own pooling regions, and raise a simple yet effective method for learning pooling region. Concretely, a kind of small window named observation window is used to obtain its responses to each word over the whole image region. The pooling regions of each word are organized by a kind of tree structure, in which each node indicates a pooling region. For each word, its pooling regions are learned by constructing a tree with its labelled coordinate data. The labelled coordinate data consist of the coordinates of responses and image class labels. The effectiveness of our method is validated by observing if there is an obvious classification accuracy improvement after applying our method. Our experimental results on four small datasets (i.e., Scene-15, Caltech-101, Caltech-256 and Corel-10) show that, the classification accuracy is improved by about 1% to 2.5%. We experimentally demonstrate that the practice of making each word have its own pooling regions is beneficial to image classification task, which is the significance of our work.

SUBMITTER: Xu Y

PROVIDER: S-EPMC7274423 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Pooling region learning of visual word for image classification using bag-of-visual-words model.

Xu Ye Y Yu Xiaodong X Wang Tian T Xu Zezhong Z

PloS one 20200605 6

In the problem where there is not enough data to use Deep Learning, Bag-of-Visual-Words (BoVW) is still a good alternative for image classification. In BoVW model, many pooling methods are proposed to incorporate the spatial information of local feature into the image representation vector, but none of the methods devote to making each visual word have its own pooling regions. The practice of designing the same pooling regions for all the words restrains the discriminability of image representat ...[more]

PMID: 32502224

Dataset Information

Pooling region learning of visual word for image classification using bag-of-visual-words model.

Publications

Pooling region learning of visual word for image classification using bag-of-visual-words model.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Advancing bag-of-visual-words representations for lesion classification in retinal images.
| S-EPMC4041723 | biostudies-literature

How are visual words represented? Insights from EEG-based visual word decoding, feature derivation and image reconstruction.
| S-EPMC6865374 | biostudies-literature

Classification of <i>Neisseria meningitidis</i> genomes with a bag-of-words approach and machine learning.
| S-EPMC10910294 | biostudies-literature

A method of protein model classification and retrieval using bag-of-visual-features.
| S-EPMC4165735 | biostudies-literature

Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease.
| S-EPMC8941143 | biostudies-literature

The influence of preprocessing on text classification using a bag-of-words representation.
| S-EPMC7194364 | biostudies-literature

Affective valence of words differentially affects visual and auditory word recognition.
| S-EPMC7616442 | biostudies-literature

Visual word learning in adults with dyslexia.
| S-EPMC4018562 | biostudies-literature

Pitch enhancement facilitates word learning across visual contexts.
| S-EPMC4273622 | biostudies-literature

A visual–omics foundation model to bridge histopathology image with transcriptomics
2025-04-01 | GSE293199 | GEO