Ontology highlight
ABSTRACT:
SUBMITTER: Green CS
PROVIDER: S-EPMC2941269 | biostudies-literature | 2010 Sep
REPOSITORIES: biostudies-literature
Green C S CS Benson C C Kersten D D Schrater P P
Proceedings of the National Academy of Sciences of the United States of America 20100830 37
How to compute initially unknown reward values makes up one of the key problems in reinforcement learning theory, with two basic approaches being used. Model-free algorithms rely on the accumulation of substantial amounts of experience to compute the value of actions, whereas in model-based learning, the agent seeks to learn the generative process for outcomes from which the value of actions can be predicted. Here we show that (i) "probability matching"-a consistent example of suboptimal choice ...[more]