Dataset Information

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals.

ABSTRACT: Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action sequences. Punishment signals are typically disembodied, i.e., with no or little relation to the agent-intrinsic limitations, and they are often used to impose behavioral constraints. Here, we provide an alternative approach for nociceptive signals as drivers of learning rather than simple triggers of preprogrammed behavior. Explicitly, we use nociception to expand the state space while we use punishment as a negative reinforcement learning signal. We compare the performance-in terms of task error, the amount of perceived nociception, and length of learned action sequences-of different neural networks imbued with punishment-based reinforcement signals for inverse kinematic learning. We contrast the performance of a version of the neural network that receives nociceptive inputs to that without such a process. Furthermore, we provide evidence that nociception can improve learning-making the algorithm more robust against network initializations-as well as behavioral performance by reducing the task error, perceived nociception, and length of learned action sequences. Moreover, we provide evidence that punishment, at least as typically used within reinforcement learning applications, may be detrimental in all relevant metrics.

SUBMITTER: Navarro-Guerrero N

PROVIDER: S-EPMC5376586 | biostudies-other | 2017

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals.

Navarro-Guerrero Nicolás N Lowe Robert J RJ Wermter Stefan S

Frontiers in neurorobotics 20170403

Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action sequences. Punishment signals are typically disembodied, i.e., with no or little relation to the agent-intrinsic limitations, and they are often used to impose behavioral constraints. Here, we provide ...[more]

PMID: 28420976

Dataset Information

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals.

Publications

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A soft artificial muscle driven robot with reinforcement learning.
| S-EPMC6162322 | biostudies-literature

Controlling the Solo12 quadruped robot with deep reinforcement learning.
| S-EPMC10366154 | biostudies-literature

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction.
| S-EPMC8982074 | biostudies-literature

Reinforcement Learning With Vision-Proprioception Model for Robot Planar Pushing.
| S-EPMC8926160 | biostudies-literature

Characterization of continuum robot arms under reinforcement learning and derived improvements.
| S-EPMC9475256 | biostudies-literature

Improving candidate Biosynthetic Gene Clusters in fungi through reinforcement learning.
| S-EPMC9364373 | biostudies-literature

Stable Jumping Control Based on Deep Reinforcement Learning for a Locust-Inspired Robot.
| S-EPMC11430585 | biostudies-literature

Age-dependent predictors of effective reinforcement motor learning across childhood.
| S-EPMC11257637 | biostudies-literature

Intrinsic interactive reinforcement learning - Using error-related potentials for real world human-robot interaction.
| S-EPMC5730605 | biostudies-literature

Reinforcement learning establishes a minimal metacognitive process to monitor and control motor learning performance.
| S-EPMC10329706 | biostudies-literature