Dataset Information

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

ABSTRACT: Direct reciprocity, or repeated interaction, is a main mechanism to sustain cooperation under social dilemmas involving two individuals. For larger groups and networks, which are probably more relevant to understanding and engineering our society, experiments employing repeated multiplayer social dilemma games have suggested that humans often show conditional cooperation behavior and its moody variant. Mechanisms underlying these behaviors largely remain unclear. Here we provide a proximate account for this behavior by showing that individuals adopting a type of reinforcement learning, called aspiration learning, phenomenologically behave as conditional cooperator. By definition, individuals are satisfied if and only if the obtained payoff is larger than a fixed aspiration level. They reinforce actions that have resulted in satisfactory outcomes and anti-reinforce those yielding unsatisfactory outcomes. The results obtained in the present study are general in that they explain extant experimental results obtained for both so-called moody and non-moody conditional cooperation, prisoner's dilemma and public goods games, and well-mixed groups and networks. Different from the previous theory, individuals are assumed to have no access to information about what other individuals are doing such that they cannot explicitly use conditional cooperation rules. In this sense, myopic aspiration learning in which the unconditional propensity of cooperation is modulated in every discrete time step explains conditional behavior of humans. Aspiration learners showing (moody) conditional cooperation obeyed a noisy GRIM-like strategy. This is different from the Pavlov, a reinforcement learning strategy promoting mutual cooperation in two-player situations.

SUBMITTER: Ezaki T

PROVIDER: S-EPMC4954710 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

Ezaki Takahiro T Horita Yutaka Y Takezawa Masanori M Masuda Naoki N

PLoS computational biology 20160720 7

Direct reciprocity, or repeated interaction, is a main mechanism to sustain cooperation under social dilemmas involving two individuals. For larger groups and networks, which are probably more relevant to understanding and engineering our society, experiments employing repeated multiplayer social dilemma games have suggested that humans often show conditional cooperation behavior and its moody variant. Mechanisms underlying these behaviors largely remain unclear. Here we provide a proximate acco ...[more]

PMID: 27438888

Dataset Information

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

Publications

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Reinforcement learning accounts for moody conditional cooperation behavior: experimental results.
| S-EPMC5223288 | biostudies-literature

Intrinsic fluctuations of reinforcement learning promote cooperation.
| S-EPMC9873645 | biostudies-literature

Multiagent cooperation and competition with deep reinforcement learning.
| S-EPMC5381785 | biostudies-literature

Scaffolding cooperation in human groups with deep reinforcement learning.
| S-EPMC10593606 | biostudies-literature

A hierarchical reinforcement learning model explains individual differences in attentional set shifting.
| S-EPMC11525250 | biostudies-literature

Payoff-based learning explains the decline in cooperation in public goods games.
| S-EPMC4309006 | biostudies-literature

Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder.
| S-EPMC9733204 | biostudies-literature

Conditional cooperation with longer memory.
| S-EPMC11648855 | biostudies-literature

Conditional cooperation in group contests.
| S-EPMC7757887 | biostudies-literature

Neural basis of conditional cooperation.
| S-EPMC3110432 | biostudies-literature