Dataset Information

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.

ABSTRACT: Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants in these environments, we show that RL theories need to include surprise and novelty, each with a distinct role. While novelty drives exploration before the first encounter of a reward, surprise increases the rate of learning of a world-model as well as of model-free action-values. Even though the world-model is available for model-based RL, we find that human decisions are dominated by model-free action choices. The world-model is only marginally used for planning, but it is important to detect surprising events. Our theory predicts human action choices with high probability and allows us to dissociate surprise, novelty, and reward in EEG signals.

SUBMITTER: Xu HA

PROVIDER: S-EPMC8205159 | biostudies-literature | 2021 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.

Xu He A HA Modirshanechi Alireza A Lehmann Marco P MP Gerstner Wulfram W Herzog Michael H MH

PLoS computational biology 20210603 6

Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants in these environments, we show that RL theories need to include surprise and novelty, each with a distinct role. While novelty drives exploration before the first encounter of a reward, surprise incre ...[more]

PMID: 34081705

Dataset Information

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.

Publications

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Superhuman artificial intelligence can improve human decision-making by increasing novelty.
| S-EPMC10041097 | biostudies-literature

Social Influences in Sequential Decision Making.
| S-EPMC4718651 | biostudies-literature

Adaptive Gaze Behavior and Decision Making of Penalty Corner Strikers in Field Hockey.
| S-EPMC8366230 | biostudies-literature

Attention to novelty interferes with toddlers' emerging memory decision-making.
| S-EPMC10770300 | biostudies-literature

What to Choose Next? A Paradigm for Testing Human Sequential Decision Making.
| S-EPMC5339299 | biostudies-literature

Subjective optimality in finite sequential decision-making.
| S-EPMC8675647 | biostudies-literature

A multi-stage anticipated surprise model with dynamic expectation for economic decision-making.
| S-EPMC10770108 | biostudies-literature

Heuristic and optimal policy computations in the human brain during sequential decision-making.
| S-EPMC5780427 | biostudies-literature

Brain and behavior in decision-making.
| S-EPMC4081035 | biostudies-literature

Motivational system modulates brain responses during exploratory decision-making.
| S-EPMC8339076 | biostudies-literature