首页> 外文会议>IEEE International Conference on Fuzzy Systems >Differential Reinforcement-type Shaping Q-Learning Method Based on Animal Training for Autonomous Mobile Robot

【24h】

Differential Reinforcement-type Shaping Q-Learning Method Based on Animal Training for Autonomous Mobile Robot

机译：基于动物训练的自主移动机器人差动加固型Q学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the general idea of "shaping" used by ethology, behavior analysis or animal training is a remarkable method. "Shaping" is a general idea that the learner is given a reinforcement signal step by step gradually and inductively forward the behavior from easy tasks to complicated tasks. In this paper, we propose a shaping reinforcement learning method took in a general idea of Shaping to the reinforcement learning that can acquire a desired behavior by the repeated search autonomously. Three different shaping reinforcement learning methods used Q-Learning, Profit Sharing, and Actor-Critic to check the efficiency of the Shaping were proposed at first. Furthermore, we proposed the Differential Reinforcement-type Shaping Q-Learning (DR-SQL) applied a general idea of "differential reinforcement" to reinforce a special behavior step by step such as real animal training, and confirmed the effectiveness of these methods by the simulation experiment of grid search problem.

机译：最近，道德学使用的“塑造”的一般思想，行为分析或动物培训是一种显着的方法。 “塑造”是一般的想法，即学习者逐步逐步逐步赋予加强信号，并从简单的任务到复杂任务的行为。在本文中，我们提出了一种成型加强学习方法，其综合思想塑造到增强学习，可以通过重复搜索自主获取所需的行为。三种不同的成型强化学习方法使用Q-Learning，利润分享和演员 - 批评者首先提出了塑造的效率。此外，我们提出了差动加强型整形Q-Learning（DR-SQL）应用了“差动增强”的一般思想，以通过诸如真实的动物训练等步骤加强特殊行为，并确认了这些方法的有效性网格搜索问题的仿真实验。

著录项

来源
《IEEE International Conference on Fuzzy Systems 》|2008年||共6页
会议地点
作者
Yoichiro Maeda; Satoshi Hanaka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273.4-53;
关键词

相似文献

外文文献
中文文献
专利

1. AN ENVIRONMENTAL VISUAL FEATURES BASED NAVIGATION METHOD FOR AUTONOMOUS MOBILE ROBOTS [J] . Fairul Azni Jafar, Yasunori Suzuki, Yuki Tateno, International Journal of Innovative Computing Information and Control . 2011 ,第3期

机译：基于环境视觉特征的自主机器人导航方法
2. A positional information sharing method based on "Figure Of Confidence" for a herd of autonomous mobile robots [J] . Kobayashi Hiroyuki, Matsuo Yoshiki, Makino Koji 電気学会論文誌. C . 2001 ,第8期

机译：基于“信心图”的一群自主移动机器人位置信息共享方法
3. A Reinforcement Learning Method for Dynamic Behavior Arbitration of Autonomous Mobile Robots Based on the Immunological Information Processing Mechanisms [J] . Akio Ishiguro, Toshiyuki Kondo, Yuji Watanabe 電気学会論文誌. C . 1997 ,第1期

机译：基于免疫信息处理机制的自主移动机器人动态行为仲裁强化学习方法
4. Differential Reinforcement-type Shaping Q-Learning Method Based on Animal Training for Autonomous Mobile Robot [C] . Yoichiro Maeda, Satoshi Hanaka IEEE International Conference on Fuzzy Systems . 2008

机译：基于动物训练的自主移动机器人差动加固型Q学习方法
5. Motion Planning and Control of Autonomous Mobile Robots: Model Based and Model Free Methods [D] . Fahad, Muhammad . 2019

机译：自主移动机器人的运动规划与控制：基于模型和模型免费方法
6. A New Positioning Method Based on Multiple Ultrasonic Sensors for Autonomous Mobile Robot [O] . Mingqi Shen, Yuying Wang, Yandan Jiang, 2020

机译：基于多个超声波传感器的自主移动机器人定位新方法
7. A Bio-inspired Autonomous Navigation Controller for Differential Mobile Robots Based on Crowd Dynamics [O] . Alejandro Rodriguez-Angeles, Henk Nijmeijer, Fransis J. M. van Kuijk 2016

机译：基于人群动态的差分移动机器人生物启发自主导航控制器

Differential Reinforcement-type Shaping Q-Learning Method Based on Animal Training for Autonomous Mobile Robot

摘要

著录项

相似文献

相关主题

期刊订阅