Unknown

Dataset Information

0

A distributional code for value in dopamine-based reinforcement learning.


ABSTRACT: Since its introduction, the reward prediction error theory of dopamine has explained a wealth of empirical phenomena, providing a unifying framework for understanding the representation of reward and value in the brain1-3. According to the now canonical theory, reward predictions are represented as a single scalar quantity, which supports learning about the expectation, or mean, of stochastic outcomes. Here we propose an account of dopamine-based reinforcement learning inspired by recent artificial intelligence research on distributional reinforcement learning4-6. We hypothesized that the brain represents possible future rewards not as a single mean, but instead as a probability distribution, effectively representing multiple future outcomes simultaneously and in parallel. This idea implies a set of empirical predictions, which we tested using single-unit recordings from mouse ventral tegmental area. Our findings provide strong evidence for a neural realization of distributional reinforcement learning.

SUBMITTER: Dabney W 

PROVIDER: S-EPMC7476215 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

A distributional code for value in dopamine-based reinforcement learning.

Dabney Will W   Kurth-Nelson Zeb Z   Uchida Naoshige N   Starkweather Clara Kwon CK   Hassabis Demis D   Munos Rémi R   Botvinick Matthew M  

Nature 20200115 7792


Since its introduction, the reward prediction error theory of dopamine has explained a wealth of empirical phenomena, providing a unifying framework for understanding the representation of reward and value in the brain<sup>1-3</sup>. According to the now canonical theory, reward predictions are represented as a single scalar quantity, which supports learning about the expectation, or mean, of stochastic outcomes. Here we propose an account of dopamine-based reinforcement learning inspired by rec  ...[more]

Similar Datasets

| S-EPMC4728725 | biostudies-literature
| S-EPMC5993631 | biostudies-literature
| S-EPMC4138746 | biostudies-literature
| S-EPMC6291167 | biostudies-literature
| S-EPMC9402573 | biostudies-literature
| S-EPMC10477234 | biostudies-literature
| S-EPMC10390629 | biostudies-literature
| S-EPMC9630918 | biostudies-literature
| S-EPMC5531832 | biostudies-literature
| S-EPMC2042203 | biostudies-literature