首页> 外文会议>International Conference on Artificial Intelligence: Technologies and Applications >Adaptively Finding Optimal Routes under Principles of Spatial Cognition: A Hierarchical Reinforcement Learning Approach
【24h】

Adaptively Finding Optimal Routes under Principles of Spatial Cognition: A Hierarchical Reinforcement Learning Approach

机译:自适应地发现空间认知原则下的最佳路线:分层加强学习方法

获取原文

摘要

Way finding research has paid much attention to the selection of optimal routes under principles of spatial cognition. However, the commonly employed implemental approaches suffer inevitably from the contradictions between personalized network modelling and network data sharing. This paper presents one kind of interactive route selection approach based on hierarchical reinforcement learning. In this approach, a complete network model is unnecessary, but the environmental states are automatically perceived by the agent and then mapped into the reward function defining the goal of cognitively optimal routes. The optimal routes corresponding to the policies with maximal cumulative rewards can be found through a two-stage learning process including a pre-learning stage and a real-time learning one. Our experimental results show that the proposed approach learns effectively enough for real-time route selection and ensures found routes close to global optimal ones.
机译:寻找研究的方式关注在空间认知原则下选择最佳路线。然而,通常采用的实施方法不可避免地遭受个性化网络建模和网络数据共享之间的矛盾。本文介绍了一种基于等级加固学习的互动路径选择方法。在这种方法中,不需要一个完整的网络模型,但是,该州的环境态被代理自动感知,然后映射到奖励函数,定义认知最佳路线的目标。通过两个阶段学习过程可以找到与具有最大累积奖励的策略相对应的最佳路线,包括预学习阶段和实时学习。我们的实验结果表明,该方法有效地学习足以进行实时路线选择,并确保找到靠近全局最优的路线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号