Dataset Information

Deterministic response strategies in a trial-and-error learning task.

ABSTRACT: Trial-and-error learning is a universal strategy for establishing which actions are beneficial or harmful in new environments. However, learning stimulus-response associations solely via trial-and-error is often suboptimal, as in many settings dependencies among stimuli and responses can be exploited to increase learning efficiency. Previous studies have shown that in settings featuring such dependencies, humans typically engage high-level cognitive processes and employ advanced learning strategies to improve their learning efficiency. Here we analyze in detail the initial learning phase of a sample of human subjects (N = 85) performing a trial-and-error learning task with deterministic feedback and hidden stimulus-response dependencies. Using computational modeling, we find that the standard Q-learning model cannot sufficiently explain human learning strategies in this setting. Instead, newly introduced deterministic response models, which are theoretically optimal and transform stimulus sequences unambiguously into response sequences, provide the best explanation for 50.6% of the subjects. Most of the remaining subjects either show a tendency towards generic optimal learning (21.2%) or at least partially exploit stimulus-response dependencies (22.3%), while a few subjects (5.9%) show no clear preference for any of the employed models. After the initial learning phase, asymptotic learning performance during the subsequent practice phase is best explained by the standard Q-learning model. Our results show that human learning strategies in the presented trial-and-error learning task go beyond merely associating stimuli and responses via incremental reinforcement. Specifically during initial learning, high-level cognitive processes support sophisticated learning strategies that increase learning efficiency while keeping memory demands and computational efforts bounded. The good asymptotic fit of the Q-learning model indicates that these cognitive processes are successively replaced by the formation of stimulus-response associations over the course of learning.

SUBMITTER: Mohr H

PROVIDER: S-EPMC6289466 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Deterministic response strategies in a trial-and-error learning task.

Mohr Holger H Zwosta Katharina K Markovic Dimitrije D Bitzer Sebastian S Wolfensteller Uta U Ruge Hannes H

PLoS computational biology 20181129 11

Trial-and-error learning is a universal strategy for establishing which actions are beneficial or harmful in new environments. However, learning stimulus-response associations solely via trial-and-error is often suboptimal, as in many settings dependencies among stimuli and responses can be exploited to increase learning efficiency. Previous studies have shown that in settings featuring such dependencies, humans typically engage high-level cognitive processes and employ advanced learning strateg ...[more]

PMID: 30496285

Dataset Information

Deterministic response strategies in a trial-and-error learning task.

Publications

Deterministic response strategies in a trial-and-error learning task.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Interactions between sensory prediction error and task error during implicit motor learning.
| S-EPMC8979451 | biostudies-literature

Dynamics of error-related activity in deterministic learning - an EEG and fMRI study.
| S-EPMC6168565 | biostudies-literature

Multi-task learning uncovers robust translation cis-regulatory features
2022-04-28 | GSE201766 | GEO

Reward abundance interferes with error-based learning in a visuomotor adaptation task.
| S-EPMC5841744 | biostudies-literature

Delays without mistakes: response time and error distributions in dual-task.
| S-EPMC2527526 | biostudies-literature

Cognitive strategies regulate fictive, but not reward prediction error signals in a sequential investment task.
| S-EPMC4105325 | biostudies-literature

Deterministic error correction for nonlocal spatial-polarization hyperentanglement.
| S-EPMC4748264 | biostudies-other

Mice adaptively generate choice variability in a deterministic task.
| S-EPMC6972896 | biostudies-literature

Measurement protocols, random-variable-valued measurements, and response process error: Estimation and inference when sample data are not deterministic.
| S-EPMC7529193 | biostudies-literature

Exploring the fundamental dynamics of error-based motor learning using a stationary predictive-saccade task.
| S-EPMC3179473 | biostudies-literature