Ontology highlight
ABSTRACT:
SUBMITTER: Park YJ
PROVIDER: S-EPMC6739057 | biostudies-literature | 2019
REPOSITORIES: biostudies-literature
Park Young Joon YJ Cho Yoon Sang YS Kim Seoung Bum SB
PloS one 20190911 9
We propose a method for learning multi-agent policies to compete against multiple opponents. The method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients that promote cooperation between agents by communication. The learning process does not require access to opponents' parameters or observations because the agents are trained separately from the opponents. The actor networks enable the agents to communicate using forward and backward paths while ...[more]