Unknown

Dataset Information

0

The gradient of the reinforcement landscape influences sensorimotor learning.


ABSTRACT: Consideration of previous successes and failures is essential to mastering a motor skill. Much of what we know about how humans and animals learn from such reinforcement feedback comes from experiments that involve sampling from a small number of discrete actions. Yet, it is less understood how we learn through reinforcement feedback when sampling from a continuous set of possible actions. Navigating a continuous set of possible actions likely requires using gradient information to maximize success. Here we addressed how humans adapt the aim of their hand when experiencing reinforcement feedback that was associated with a continuous set of possible actions. Specifically, we manipulated the change in the probability of reward given a change in motor action-the reinforcement gradient-to study its influence on learning. We found that participants learned faster when exposed to a steep gradient compared to a shallow gradient. Further, when initially positioned between a steep and a shallow gradient that rose in opposite directions, participants were more likely to ascend the steep gradient. We introduce a model that captures our results and several features of motor learning. Taken together, our work suggests that the sensorimotor system relies on temporally recent and spatially local gradient information to drive learning.

SUBMITTER: Cashaback JGA 

PROVIDER: S-EPMC6417747 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

The gradient of the reinforcement landscape influences sensorimotor learning.

Cashaback Joshua G A JGA   Lao Christopher K CK   Palidis Dimitrios J DJ   Coltman Susan K SK   McGregor Heather R HR   Gribble Paul L PL  

PLoS computational biology 20190304 3


Consideration of previous successes and failures is essential to mastering a motor skill. Much of what we know about how humans and animals learn from such reinforcement feedback comes from experiments that involve sampling from a small number of discrete actions. Yet, it is less understood how we learn through reinforcement feedback when sampling from a continuous set of possible actions. Navigating a continuous set of possible actions likely requires using gradient information to maximize succ  ...[more]

Similar Datasets

| S-EPMC5550011 | biostudies-other
| S-EPMC6264158 | biostudies-literature
| S-EPMC6491497 | biostudies-literature
| S-EPMC3166306 | biostudies-literature
| S-EPMC4640749 | biostudies-other
| S-EPMC5487354 | biostudies-literature
| S-EPMC4760114 | biostudies-literature
| S-EPMC3247813 | biostudies-other
| S-EPMC4909278 | biostudies-literature
| S-EPMC7482564 | biostudies-literature