...
首页> 外文期刊>計測自動制御学会論文集 >An incremental state-space construction based on the notion of contradiction for reinforcement learning
【24h】

An incremental state-space construction based on the notion of contradiction for reinforcement learning

机译:基于矛盾概念的增量状态空间构造,用于强化学习

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose an incremental state-space construction method using ART neural network in order to construct appropriate state-space for reinforcement learning. The proposed method is inspired by the notion of contradiction studied by Piagget. In this method, a state-transition table which represents the learner's states and actions is recorded. Then, if the current state transition against a certain perception is in conflict with the record, a new state for such perception is generated. We introduce two kinds of contradiction: "a contradiction such that different results are caused by the same states and the same actions" and "a contradiction due to ambiguous states" Several computer simulations on pole-balancing problem and light seeking problem for autonomous mobile robots confirm us the effectiveness of the proposed state-space construction method.
机译:在本文中,我们提出了一种使用ART神经网络的增量状态空间构造方法,以构造用于增强学习的适当状态空间。提出的方法受Piagget研究的矛盾概念的启发。在这种方法中,记录了代表学习者状态和动作的状态转换表。然后,如果针对某个感知的当前状态转换与记录冲突,则会生成用于该感知的新状态。我们引入了两种矛盾:“一种矛盾,使得相同的状态和相同的动作导致不同的结果”和“由于歧义的状态而引起的矛盾”。关于自动移动机器人的极点平衡问题和寻光问题的几种计算机模拟确认我们提出的状态空间构造方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号