首页> 外文会议>IEEE International Conference on Fuzzy Systems >Differential Reinforcement-type Shaping Q-Learning Method Based on Animal Training for Autonomous Mobile Robot
【24h】

Differential Reinforcement-type Shaping Q-Learning Method Based on Animal Training for Autonomous Mobile Robot

机译:基于动物训练的自主移动机器人差动加固型Q学习方法

获取原文

摘要

Recently, the general idea of "shaping" used by ethology, behavior analysis or animal training is a remarkable method. "Shaping" is a general idea that the learner is given a reinforcement signal step by step gradually and inductively forward the behavior from easy tasks to complicated tasks. In this paper, we propose a shaping reinforcement learning method took in a general idea of Shaping to the reinforcement learning that can acquire a desired behavior by the repeated search autonomously. Three different shaping reinforcement learning methods used Q-Learning, Profit Sharing, and Actor-Critic to check the efficiency of the Shaping were proposed at first. Furthermore, we proposed the Differential Reinforcement-type Shaping Q-Learning (DR-SQL) applied a general idea of "differential reinforcement" to reinforce a special behavior step by step such as real animal training, and confirmed the effectiveness of these methods by the simulation experiment of grid search problem.
机译:最近,道德学使用的“塑造”的一般思想,行为分析或动物培训是一种显着的方法。 “塑造”是一般的想法,即学习者逐步逐步逐步赋予加强信号,并从简单的任务到复杂任务的行为。在本文中,我们提出了一种成型加强学习方法,其综合思想塑造到增强学习,可以通过重复搜索自主获取所需的行为。三种不同的成型强化学习方法使用Q-Learning,利润分享和演员 - 批评者首先提出了塑造的效率。此外,我们提出了差动加强型整形Q-Learning(DR-SQL)应用了“差动增强”的一般思想,以通过诸如真实的动物训练等步骤加强特殊行为,并确认了这些方法的有效性网格搜索问题的仿真实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号