...
首页> 外文期刊>計測自動制御学会論文集 >An incremental state-space construction based on the notion of contradiction for reinforcement learning
【24h】

An incremental state-space construction based on the notion of contradiction for reinforcement learning

机译:基于加强学习矛盾矛盾的增量状态空间施工

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we propose an incremental state-space construction method using ART neural network in order to construct appropriate state-space for reinforcement learning. The proposed method is inspired by the notion of contradiction studied by Piagget. In this method, a state-transition table which represents the learner's states and actions is recorded. Then, if the current state transition against a certain perception is in conflict with the record, a new state for such perception is generated. We introduce two kinds of contradiction: "a contradiction such that different results are caused by the same states and the same actions" and "a contradiction due to ambiguous states" Several computer simulations on pole-balancing problem and light seeking problem for autonomous mobile robots confirm us the effectiveness of the proposed state-space construction method.
机译:在本文中,我们提出了一种使用艺术神经网络的增量状态空间施工方法,以构建适当的钢筋学习状态空间。 该方法的启发是通过Piagget研究的矛盾的概念的启发。 在此方法中,记录代表学习者状态和动作的状态转换表。 然后,如果对某一感知的当前状态转换与记录冲突,则生成用于这种感知的新状态。 我们介绍了两种矛盾:“一个矛盾,不同的结果是由同一国家和相同的行动引起的”和“暧昧状态导致的矛盾”几个计算机模拟对极衡问题和自主移动机器人的光寻求问题 确认我们所提出的状态空间施工方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号