Unknown

Dataset Information

0

Predicting human complexity perception of real-world scenes.


ABSTRACT: Perceptual load is a well-established determinant of attentional engagement in a task. So far, perceptual load has typically been manipulated by increasing either the number of task-relevant items or the perceptual processing demand (e.g. conjunction versus feature tasks). The tasks used often involved rather simple visual displays (e.g. letters or single objects). How can perceptual load be operationalized for richer, real-world images? A promising proxy is the visual complexity of an image. However, current predictive models for visual complexity have limited applicability to diverse real-world images. Here we modelled visual complexity using a deep convolutional neural network (CNN) trained to learn perceived ratings of visual complexity. We presented 53 observers with 4000 images from the PASCAL VOC dataset, obtaining 75 020 2-alternative forced choice paired comparisons across observers. Image visual complexity scores were obtained using the TrueSkill algorithm. A CNN with weights pre-trained on an object recognition task predicted complexity ratings with r = 0.83. By contrast, feature-based models used in the literature, working on image statistics such as entropy, edge density and JPEG compression ratio, only achieved r = 0.70. Thus, our model offers a promising method to quantify the perceptual load of real-world scenes through visual complexity.

SUBMITTER: Nagle F 

PROVIDER: S-EPMC7277246 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting human complexity perception of real-world scenes.

Nagle Fintan F   Lavie Nilli N  

Royal Society open science 20200513 5


Perceptual load is a well-established determinant of attentional engagement in a task. So far, perceptual load has typically been manipulated by increasing either the number of task-relevant items or the perceptual processing demand (e.g. conjunction versus feature tasks). The tasks used often involved rather simple visual displays (e.g. letters or single objects). How can perceptual load be operationalized for richer, real-world images? A promising proxy is the visual complexity of an image. Ho  ...[more]

Similar Datasets

| S-EPMC4919018 | biostudies-literature
| S-EPMC7900870 | biostudies-literature
| S-EPMC6647383 | biostudies-literature
| S-EPMC5514174 | biostudies-literature
| S-EPMC10560015 | biostudies-literature
| S-EPMC10407052 | biostudies-literature
| S-EPMC9579010 | biostudies-literature
| S-EPMC6222059 | biostudies-literature
| S-EPMC10632141 | biostudies-literature
| S-EPMC6315028 | biostudies-literature