Unknown

Dataset Information

0

Modeling Image Patches with a Generic Dictionary of Mini-Epitomes.


ABSTRACT: The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of epitomes allows us to explicitly account for photometric and position variability in image appearance. We show that this flexibility considerably increases the capacity of the dictionary to accurately approximate the appearance of image patches and support recognition tasks. For image classification, we develop histogram-based image encoding methods tailored to the epitomic representation, as well as an "epitomic footprint" encoding which is easy to visualize and highlights the generative nature of our model. We discuss in detail computational aspects and develop efficient algorithms to make the model scalable to large tasks. The proposed techniques are evaluated with experiments on the challenging PASCAL VOC 2007 image classification benchmark.

SUBMITTER: Papandreou G 

PROVIDER: S-EPMC4550088 | biostudies-literature | 2014 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Modeling Image Patches with a Generic Dictionary of Mini-Epitomes.

Papandreou George G   Chen Liang-Chieh LC   Yuille Alan L AL  

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition 20140601


The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of  ...[more]

Similar Datasets

| S-EPMC7039536 | biostudies-literature
| S-EPMC6899488 | biostudies-literature
| S-EPMC3496339 | biostudies-literature
| S-EPMC4550222 | biostudies-literature
| S-EPMC5536359 | biostudies-literature
| S-EPMC5732496 | biostudies-literature
| S-EPMC8875652 | biostudies-literature
| S-EPMC7405596 | biostudies-literature
| S-EPMC7319327 | biostudies-literature
| S-EPMC9202882 | biostudies-literature