Dataset Information

You Were Always on My Mind: Introducing Chef's Hat and COPPER for Personalized Reinforcement Learning.

ABSTRACT: Reinforcement learning simulation environments pose an important experimental test bed and facilitate data collection for developing AI-based robot applications. Most of them, however, focus on single-agent tasks, which limits their application to the development of social agents. This study proposes the Chef's Hat simulation environment, which implements a multi-agent competitive card game that is a complete reproduction of the homonymous board game, designed to provoke competitive strategies in humans and emotional responses. The game was shown to be ideal for developing personalized reinforcement learning, in an online learning closed-loop scenario, as its state representation is extremely dynamic and directly related to each of the opponent's actions. To adapt current reinforcement learning agents to this scenario, we also developed the COmPetitive Prioritized Experience Replay (COPPER) algorithm. With the help of COPPER and the Chef's Hat simulation environment, we evaluated the following: (1) 12 experimental learning agents, trained via four different regimens (self-play, play against a naive baseline, PER, or COPPER) with three algorithms based on different state-of-the-art learning paradigms (PPO, DQN, and ACER), and two "dummy" baseline agents that take random actions, (2) the performance difference between COPPER and PER agents trained using the PPO algorithm and playing against different agents (PPO, DQN, and ACER) or all DQN agents, and (3) human performance when playing against two different collections of agents. Our experiments demonstrate that COPPER helps agents learn to adapt to different types of opponents, improving the performance when compared to off-line learning models. An additional contribution of the study is the formalization of the Chef's Hat competitive game and the implementation of the Chef's Hat Player Club, a collection of trained and assessed agents as an enabler for embedding human competitive strategies in social continual and competitive reinforcement learning.

SUBMITTER: Barros P

PROVIDER: S-EPMC8323774 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

You Were Always on My Mind: Introducing Chef's Hat and COPPER for Personalized Reinforcement Learning.

Barros Pablo P Bloem Anne C AC Hootsmans Inge M IM Opheij Lena M LM Toebosch Romain H A RHA Barakova Emilia E Sciutti Alessandra A

Frontiers in robotics and AI 20210716

Reinforcement learning simulation environments pose an important experimental test bed and facilitate data collection for developing AI-based robot applications. Most of them, however, focus on single-agent tasks, which limits their application to the development of social agents. This study proposes the Chef's Hat simulation environment, which implements a multi-agent competitive card game that is a complete reproduction of the homonymous board game, designed to provoke competitive strategies i ...[more]

PMID: 34336935

Similar Datasets

Project description:Recommender systems have become a core component of various online platforms, helping users get relevant information from the abundant digital data. Traditional RSs often generate static recommendations, which may not adapt well to changing user preferences. To address this problem, we propose a novel reinforcement learning (RL) recommendation algorithm that can give personalized recommendations by adapting to changing user preferences. However, a significant drawback of RL-based recommendation systems is that they are computationally expensive. Moreover, these systems often fail to extract local patterns residing within dataset which may result in generation of low quality recommendations. The proposed work utilizes biclustering technique to create an efficient environment for RL agents, thus, reducing computation cost and enabling the generation of dynamic recommendations. Additionally, biclustering is used to find locally associated patterns in the dataset, which further improves the efficiency of the RL agent's learning process. The proposed work experiments eight state-of-the-art biclustering algorithms to identify the appropriate biclustering algorithm for the given recommendation task. This innovative integration of biclustering and reinforcement learning addresses key gaps in existing literature. Moreover, we introduced a novel strategy to predict item ratings within the RL framework. The validity of the proposed algorithm is evaluated on three datasets of movies domain, namely, ML100K, ML-latest-small and FilmTrust. These diverse datasets were chosen to ensure reliable examination across various scenarios. As per the dynamic nature of RL, some specific evaluation metrics like personalization, diversity, intra-list similarity and novelty are used to measure the diversity of recommendations. This investigation is motivated by the need for recommender systems that can dynamically adjust to changes in customer preferences. Results show that our proposed algorithm showed promising results when compared with existing state-of-the-art recommendation techniques.

Project description:ObjectivesTen per cent of patients diagnosed with pancreatic cancer undergo pancreaticoduodenectomy. There is limited previous research focusing on psychological well-being; unmet support needs impact negatively on quality of life. This paper reports the psychological impact of a pancreatic cancer diagnosis and subsequent pancreaticoduodenectomy, exploring how patients' lives alter following surgery and how they seek support.DesignInductive qualitative study involving in-depth semistructured interviews with 20 participants who had undergone pancreaticoduodenectomy for pancreatic or distal biliary duct cancer. Interviews were audiorecorded, transcribed and anonymised, and thematic analysis used principles of constant comparison.SettingSingle National Health Service Trust in Northwest England.ParticipantsPatients were eligible for inclusion if they had had pancreaticoduodenectomy for head of pancreas cancer, periampullary cancer or distal cholangiocarcinoma between 6 months and 6 years previously, and had completed adjuvant chemotherapy.ResultsAnalysis identified the following main themes: diagnosis and decision making around surgery; recovery from surgery and chemotherapy; burden of monitoring and ongoing symptoms; adjusting to 'a new normal'; understanding around prognosis; support-seeking. Participants seized the chance to have surgery, often without seeming to absorb the risks or their prognosis. They perceived that they were unable to control their life trajectory and, although they valued close monitoring, experienced anxiety around their appointments. Participants expressed uncertainty about whether they would be able to return to their former activities. There were tensions in their comments about support-seeking, but most felt that emotional support should be offered proactively.ConclusionsPatients should be made aware of potential psychological sequelae, and that treatment completion may trigger the need for more support. Clinical nurse specialists (CNSs) were identified as key members of the team in proactively offering support; further training for CNSs should be encouraged. Understanding patients' experience of living with cancer and the impact of treatment is crucial in enabling the development of improved support interventions.

Dataset Information

You Were Always on My Mind: Introducing Chef's Hat and COPPER for Personalized Reinforcement Learning.

Publications

You Were Always on My Mind: Introducing Chef's Hat and COPPER for Personalized Reinforcement Learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets