An incremental state-space construction based on the notion of contradiction for reinforcement learning

Hisashi Handa; Akira Ninomiya; Tadashi Horiuchi; Tadataka Konishi; Mitsuru Baba

首页> 外文期刊>計測自動制御学会論文集 >An incremental state-space construction based on the notion of contradiction for reinforcement learning

【24h】

An incremental state-space construction based on the notion of contradiction for reinforcement learning

机译：基于加强学习矛盾矛盾的增量状态空间施工

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an incremental state-space construction method using ART neural network in order to construct appropriate state-space for reinforcement learning. The proposed method is inspired by the notion of contradiction studied by Piagget. In this method, a state-transition table which represents the learner's states and actions is recorded. Then, if the current state transition against a certain perception is in conflict with the record, a new state for such perception is generated. We introduce two kinds of contradiction: "a contradiction such that different results are caused by the same states and the same actions" and "a contradiction due to ambiguous states" Several computer simulations on pole-balancing problem and light seeking problem for autonomous mobile robots confirm us the effectiveness of the proposed state-space construction method.

机译：在本文中，我们提出了一种使用艺术神经网络的增量状态空间施工方法，以构建适当的钢筋学习状态空间。该方法的启发是通过Piagget研究的矛盾的概念的启发。在此方法中，记录代表学习者状态和动作的状态转换表。然后，如果对某一感知的当前状态转换与记录冲突，则生成用于这种感知的新状态。我们介绍了两种矛盾：“一个矛盾，不同的结果是由同一国家和相同的行动引起的”和“暧昧状态导致的矛盾”几个计算机模拟对极衡问题和自主移动机器人的光寻求问题确认我们所提出的状态空间施工方法的有效性。

著录项

来源
《計測自動制御学会論文集》 |2002年第5期|共8页
作者
Hisashi Handa; Akira Ninomiya; Tadashi Horiuchi; Tadataka Konishi; Mitsuru Baba;
展开▼
作者单位

Project Leader Animal Health and Welfare Cheshire County Council;

Project Leader Animal Health and Welfare Cheshire County Council;

Project Leader Animal Health and Welfare Cheshire County Council;

Project Leader Animal Health and Welfare Cheshire County Council;

Project Leader Animal Health and Welfare Cheshire County Council;

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类自动化元件、部件;
关键词
Incremental state-space construction; Reinforcement learning; ART neural network notion of contradiction;

机译：增量状态空间建设;加固学习;艺术神经网络概念矛盾;

相似文献

外文文献
中文文献
专利

1. An incremental state-space construction based on the notion of contradiction for reinforcement learning [J] . Hisashi Handa, Akira Ninomiya, Tadashi Horiuchi, 計測自動制御学会論文集 . 2002,第5期

机译：基于矛盾概念的增量状态空间构造，用于强化学习
2. Autonomous construction hoist system based on deep reinforcement learning in high-rise building construction [J] . Lee Dongmin, Kim Minhoe Automation in construction . 2021,第Auga期

机译：基于深层建筑施工深增强学习的自主建筑葫芦系统
3. Reinforcement learning-based intelligent energy management architecture for hybrid construction machinery [J] . Zhang Wei, Wang Jixin, Liu Yong, Applied Energy . 2020,第Octa1期

机译：用于混合施工机械的加固基于学习的智能能量管理架构
4. Characteristic of temperature-based reinforcement learning in learning-parameters - characteristic of convergence of learning and construction of state-space - [C] . Tsutomu Sawada, Atsushi Sugai, Sumiaki Ichikawa, 日本ロボット学会学術講演会 . 2000

机译：基于温度的增强学习参数的特征 - 学习趋同特征及状态空间施工 -
5. Examining the performance of population-based incremental learning and island model population-based incremental learning on a GA-hard problem with a very large search space. [D] . Brownlee, Benjamin Richard. 2010

机译：检查具有很大搜索空间的GA难题的基于人口的增量学习和基于岛模型的基于人口的增量学习的性能。
6. Regulating recognition decisions through incremental reinforcement learning [O] . Sanghoon Han, Ian G. Dobbins -1

机译：通过增量加强学习来调节识别决策
7. Construction of Behavioral Concepts through Social Interactions based on Reward Design: Schema-Based Incremental Reinforcement Learning [O] . Tadahiro TANIGUCHI, Tetsuo SAWARAGI 2006

机译：基于奖励设计的社会互动构建行为概念：基于架构的增量强化学习

An incremental state-space construction based on the notion of contradiction for reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅