Unknown

Dataset Information

0

Parallel reinforcement learning for weighted multi-criteria model with adaptive margin.


ABSTRACT: Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.

SUBMITTER: Hiraoka K 

PROVIDER: S-EPMC2645492 | biostudies-other | 2009 Mar

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC7508815 | biostudies-literature
| S-EPMC7308406 | biostudies-literature
| S-EPMC6739057 | biostudies-literature
| S-EPMC7731977 | biostudies-literature
| S-EPMC3867158 | biostudies-literature
| S-EPMC10229550 | biostudies-literature
| S-EPMC5500327 | biostudies-literature
| S-EPMC9805575 | biostudies-literature
| S-EPMC9298337 | biostudies-literature
| S-EPMC5312784 | biostudies-literature