An incremental state-space construction based on the notion of contradiction for reinforcement learning

Hisashi Handa; Akira Ninomiya; Tadashi Horiuchi; Tadataka Konishi; Mitsuru Baba

首页> 外文期刊>計測自動制御学会論文集 >An incremental state-space construction based on the notion of contradiction for reinforcement learning

【24h】

An incremental state-space construction based on the notion of contradiction for reinforcement learning

机译：基于矛盾概念的增量状态空间构造，用于强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an incremental state-space construction method using ART neural network in order to construct appropriate state-space for reinforcement learning. The proposed method is inspired by the notion of contradiction studied by Piagget. In this method, a state-transition table which represents the learner's states and actions is recorded. Then, if the current state transition against a certain perception is in conflict with the record, a new state for such perception is generated. We introduce two kinds of contradiction: "a contradiction such that different results are caused by the same states and the same actions" and "a contradiction due to ambiguous states" Several computer simulations on pole-balancing problem and light seeking problem for autonomous mobile robots confirm us the effectiveness of the proposed state-space construction method.

机译：在本文中，我们提出了一种使用ART神经网络的增量状态空间构造方法，以构造用于增强学习的适当状态空间。提出的方法受Piagget研究的矛盾概念的启发。在这种方法中，记录了代表学习者状态和动作的状态转换表。然后，如果针对某个感知的当前状态转换与记录冲突，则会生成用于该感知的新状态。我们引入了两种矛盾：“一种矛盾，使得相同的状态和相同的动作导致不同的结果”和“由于歧义的状态而引起的矛盾”。关于自动移动机器人的极点平衡问题和寻光问题的几种计算机模拟确认我们提出的状态空间构造方法的有效性。

著录项

来源
《計測自動制御学会論文集》 |2002年第5期|共8页
作者
Hisashi Handa; Akira Ninomiya; Tadashi Horiuchi; Tadataka Konishi; Mitsuru Baba;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类自动化元件、部件;
关键词
Incremental state-space construction; Reinforcement learning; ART neural network notion of contradiction;

机译：增量状态空间构造;强化学习;ART神经网络矛盾观;

相似文献

外文文献
中文文献
专利

1. An incremental state-space construction based on the notion of contradiction for reinforcement learning [J] . Hisashi Handa, Akira Ninomiya, Tadashi Horiuchi, 計測自動制御学会論文集 . 2002,第5期

机译：基于矛盾概念的增量状态空间构造，用于强化学习
2. Autonomous construction hoist system based on deep reinforcement learning in high-rise building construction [J] . Lee Dongmin, Kim Minhoe Automation in construction . 2021,第Auga期

机译：基于深层建筑施工深增强学习的自主建筑葫芦系统
3. Reinforcement learning-based intelligent energy management architecture for hybrid construction machinery [J] . Zhang Wei, Wang Jixin, Liu Yong, Applied Energy . 2020,第Octa1期

机译：用于混合施工机械的加固基于学习的智能能量管理架构
4. Characteristic of temperature-based reinforcement learning in learning-parameters - characteristic of convergence of learning and construction of state-space - [C] . Tsutomu Sawada, Atsushi Sugai, Sumiaki Ichikawa, 日本ロボット学会学術講演会 . 2000

机译：基于温度的增强学习参数的特征 - 学习趋同特征及状态空间施工 -
5. Examining the performance of population-based incremental learning and island model population-based incremental learning on a GA-hard problem with a very large search space. [D] . Brownlee, Benjamin Richard. 2010

机译：检查具有很大搜索空间的GA难题的基于人口的增量学习和基于岛模型的基于人口的增量学习的性能。
6. Regulating recognition decisions through incremental reinforcement learning [O] . Sanghoon Han, Ian G. Dobbins -1

机译：通过增量加强学习来调节识别决策
7. Construction of Behavioral Concepts through Social Interactions based on Reward Design: Schema-Based Incremental Reinforcement Learning [O] . Tadahiro TANIGUCHI, Tetsuo SAWARAGI 2006

机译：基于奖励设计的社会互动构建行为概念：基于架构的增量强化学习

An incremental state-space construction based on the notion of contradiction for reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅