Dataset Information

Feedforward object-vision models only tolerate small image variations compared to human.

ABSTRACT: Invariant object recognition is a remarkable ability of primates' visual system that its underlying mechanism has constantly been under intense investigations. Computational modeling is a valuable tool toward understanding the processes involved in invariant object recognition. Although recent computational models have shown outstanding performances on challenging image databases, they fail to perform well in image categorization under more complex image variations. Studies have shown that making sparse representation of objects by extracting more informative visual features through a feedforward sweep can lead to higher recognition performances. Here, however, we show that when the complexity of image variations is high, even this approach results in poor performance compared to humans. To assess the performance of models and humans in invariant object recognition tasks, we built a parametrically controlled image database consisting of several object categories varied in different dimensions and levels, rendered from 3D planes. Comparing the performance of several object recognition models with human observers shows that only in low-level image variations the models perform similar to humans in categorization tasks. Furthermore, the results of our behavioral experiments demonstrate that, even under difficult experimental conditions (i.e., briefly presented masked stimuli with complex image variations), human observers performed outstandingly well, suggesting that the models are still far from resembling humans in invariant object recognition. Taken together, we suggest that learning sparse informative visual features, although desirable, is not a complete solution for future progresses in object-vision modeling. We show that this approach is not of significant help in solving the computational crux of object recognition (i.e., invariant object recognition) when the identity-preserving image variations become more complex.

SUBMITTER: Ghodrati M

PROVIDER: S-EPMC4103258 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Feedforward object-vision models only tolerate small image variations compared to human.

Ghodrati Masoud M Farzmahdi Amirhossein A Rajaei Karim K Ebrahimpour Reza R Khaligh-Razavi Seyed-Mahdi SM

Frontiers in computational neuroscience 20140718

Invariant object recognition is a remarkable ability of primates' visual system that its underlying mechanism has constantly been under intense investigations. Computational modeling is a valuable tool toward understanding the processes involved in invariant object recognition. Although recent computational models have shown outstanding performances on challenging image databases, they fail to perform well in image categorization under more complex image variations. Studies have shown that makin ...[more]

PMID: 25100986

Dataset Information

Feedforward object-vision models only tolerate small image variations compared to human.

Publications

Feedforward object-vision models only tolerate small image variations compared to human.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

An ecologically motivated image dataset for deep learning yields better models of human vision.
| S-EPMC7923360 | biostudies-literature

Statistical Inference Models for Image Datasets with Systematic Variations.
| S-EPMC4792194 | biostudies-literature

Deep Networks Can Resemble Human Feed-forward Vision in Invariant Object Recognition.
| S-EPMC5013454 | biostudies-literature

Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition.
| S-EPMC7767307 | biostudies-literature

Visual Object Tracking in First Person Vision.
| S-EPMC9816211 | biostudies-literature

Applying artificial vision models to human scene understanding.
| S-EPMC4316773 | biostudies-literature

Object representations in the human brain reflect the co-occurrence statistics of vision and language.
| S-EPMC8253839 | biostudies-literature

VANO: a volume-object image annotation system.
| S-EPMC2647838 | biostudies-literature

SOLID-Similar object and lure image database.
| S-EPMC7005083 | biostudies-literature

CognitionMaster: an object-based image analysis framework.
| S-EPMC3626931 | biostudies-literature