Dataset Information

Dissociation between asymmetric value updating and perseverance in human reinforcement learning.

ABSTRACT: The learning rate is a key parameter in reinforcement learning that determines the extent to which novel information (outcome) is incorporated in guiding subsequent actions. Numerous studies have reported that the magnitude of the learning rate in human reinforcement learning is biased depending on the sign of the reward prediction error. However, this asymmetry can be observed as a statistical bias if the fitted model ignores the choice autocorrelation (perseverance), which is independent of the outcomes. Therefore, to investigate the genuine process underlying human choice behavior using empirical data, one should dissociate asymmetry in learning and perseverance from choice behavior. The present study addresses this issue by using a Hybrid model incorporating asymmetric learning rates and perseverance. First, by conducting simulations, we demonstrate that the Hybrid model can identify the true underlying process. Second, using the Hybrid model, we show that empirical data collected from a web-based experiment are governed by perseverance rather than asymmetric learning. Finally, we apply the Hybrid model to two open datasets in which asymmetric learning was reported. As a result, the asymmetric learning rate was validated in one dataset but not another.

SUBMITTER: Sugawara M

PROVIDER: S-EPMC7878894 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Dissociation between asymmetric value updating and perseverance in human reinforcement learning.

Sugawara Michiyo M Katahira Kentaro K

Scientific reports 20210211 1

The learning rate is a key parameter in reinforcement learning that determines the extent to which novel information (outcome) is incorporated in guiding subsequent actions. Numerous studies have reported that the magnitude of the learning rate in human reinforcement learning is biased depending on the sign of the reward prediction error. However, this asymmetry can be observed as a statistical bias if the fitted model ignores the choice autocorrelation (perseverance), which is independent of th ...[more]

PMID: 33574424

Dataset Information

Dissociation between asymmetric value updating and perseverance in human reinforcement learning.

Publications

Dissociation between asymmetric value updating and perseverance in human reinforcement learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Asymmetric reinforcement learning facilitates human inference of transitive relations.
| S-EPMC9038534 | biostudies-literature

Reinforcement learning for solution updating in Artificial Bee Colony.
| S-EPMC6049945 | biostudies-literature

The functional form of value normalization in human reinforcement learning.
| S-EPMC10393293 | biostudies-literature

Surprise Acts as a Reducer of Outcome Value in Human Reinforcement Learning.
| S-EPMC7506125 | biostudies-literature

Asymmetric and adaptive reward coding via normalized reinforcement learning.
| S-EPMC9345478 | biostudies-literature

Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning.
| S-EPMC2042203 | biostudies-literature

Human thalamic low-frequency oscillations correlate with expected value and outcomes during reinforcement learning.
| S-EPMC10582006 | biostudies-literature

A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis.
| S-EPMC9894526 | biostudies-literature

Why do valence asymmetries emerge in value learning? A reinforcement learning account.
| S-EPMC10390629 | biostudies-literature

A distributional code for value in dopamine-based reinforcement learning.
| S-EPMC7476215 | biostudies-literature