首页> 外文会议>IEEE International Conference on Control and Automation >Efficient, swarm-based path finding in unknown graphs using reinforcement learning
【24h】

Efficient, swarm-based path finding in unknown graphs using reinforcement learning

机译:使用强化学习在未知图中高效,基于群体的路径查找

获取原文

摘要

This paper addresses the problem of steering a swarm of autonomous agents out of an unknown maze to some goal located at an unknown location. This is particularly the case in situations where no direct communication between the agents is possible and all information exchange between agents has to occur indirectly through information “deposited” in the environment. To address this task, an ε-greedy collaborative reinforcement learning method using only local information exchanges is introduced in this paper to balance exploitation and exploration in the unknown maze and to optimize the ability of the swarm to exit from the maze. The learning and routing algorithm given here provides a mechanism for storing data needed to represent the collaborative utility function based on the experiences of previous agents visiting a node that results in routing decisions that improve with time. Two theorems show the theoretical soundness of the proposed learning method and illustrate the importance of the stored information in improving decision-making for routing. Simulation examples show that the introduced simple rules of learning from past experience significantly improve performance over random search and search based on Ant Colony Optimization, a metaheuristic algorithm.
机译:本文解决了将一大批自治特工从一个未知的迷宫中移到位于未知位置的某个目标的问题。在代理之间不可能直接通信并且代理之间的所有信息交换必须通过“沉积”在环境中的信息间接发生的情况下,尤其如此。为了解决这一任务,本文提出了一种仅使用局部信息交换的ε贪婪协作强化学习方法,以平衡未知迷宫中的开发和探索,并优化群体从迷宫中退出的能力。此处给出的学习和路由算法提供了一种机制,用于根据先前的代理访问节点的经验来存储表示协作实用程序功能所需的数据,从而导致路由决策随时间而改善。两个定理证明了所提出的学习方法的理论上的正确性,并说明了存储的信息在改进路由决策方面的重要性。仿真示例表明,引入的从过去的经验中学习的简单规则比随机搜索和基于元启发式算法蚁群优化的搜索显着提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号