Dataset Information

Human subjects exploit a cognitive map for credit assignment.

ABSTRACT: An influential reinforcement learning framework proposes that behavior is jointly governed by model-free (MF) and model-based (MB) controllers. The former learns the values of actions directly from past encounters, and the latter exploits a cognitive map of the task to calculate these prospectively. Considerable attention has been paid to how these systems interact during choice, but how and whether knowledge of a cognitive map contributes to the way MF and MB controllers assign credit (i.e., to how they revaluate actions and states following the receipt of an outcome) remains underexplored. Here, we examine such sophisticated credit assignment using a dual-outcome bandit task. We provide evidence that knowledge of a cognitive map influences credit assignment in both MF and MB systems, mediating subtly different aspects of apparent relevance. Specifically, we show MF credit assignment is enhanced for those rewards that are related to a choice, and this contrasted with choice-unrelated rewards that reinforced subsequent choices negatively. This modulation is only possible based on knowledge of task structure. On the other hand, MB credit assignment was boosted for outcomes that impacted on differences in values between offered bandits. We consider mechanistic accounts and the normative status of these findings. We suggest the findings extend the scope and sophistication of cognitive map-based credit assignment during reinforcement learning, with implications for understanding behavioral control.

SUBMITTER: Moran R

PROVIDER: S-EPMC7848688 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Human subjects exploit a cognitive map for credit assignment.

Moran Rani R Dayan Peter P Dolan Raymond J RJ

Proceedings of the National Academy of Sciences of the United States of America 20210101 4

An influential reinforcement learning framework proposes that behavior is jointly governed by model-free (MF) and model-based (MB) controllers. The former learns the values of actions directly from past encounters, and the latter exploits a cognitive map of the task to calculate these prospectively. Considerable attention has been paid to how these systems interact during choice, but how and whether knowledge of a cognitive map contributes to the way MF and MB controllers assign credit (i.e., to ...[more]

PMID: 33479182

Dataset Information

Human subjects exploit a cognitive map for credit assignment.

Publications

Human subjects exploit a cognitive map for credit assignment.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Strategy inference during learning via cognitive activity-based credit assignment models.
| S-EPMC10256696 | biostudies-literature

Neural reactivations during sleep determine network credit assignment.
| S-EPMC5808917 | biostudies-literature

Spatio-temporal credit assignment in neuronal population learning.
| S-EPMC3127803 | biostudies-literature

Efficiency and prioritization of inference-based credit assignment.
| S-EPMC8279739 | biostudies-literature

Retrospective model-based inference guides model-free credit assignment.
| S-EPMC6375980 | biostudies-literature

Neural correlates of temporal credit assignment in the parietal lobe.
| S-EPMC3921206 | biostudies-literature

Neural mechanisms of credit assignment for delayed outcomes during contingent learning.
| S-EPMC11326259 | biostudies-literature

Dual credit assignment processes underlie dopamine signals in a complex spatial environment.
| S-EPMC10054934 | biostudies-literature

Cell-type-specific neuromodulation guides synaptic credit assignment in a spiking neural network.
| S-EPMC8713766 | biostudies-literature

Agency rescues competition for credit assignment among predictive cues from adverse learning conditions.
| S-EPMC8355250 | biostudies-literature