【24h】

Phe-Q: A Pheromone Based Q-Learning

机译:Phe-Q:基于信息素的Q学习

获取原文
获取原文并翻译 | 示例

摘要

Biological systems have often provided inspiration for the design of artificial systems. On such example of a natural system that has inspired researchers is the ant colony. In this paper an algorithm for multi-agent reinforcement learning, a modified Q-learning, is proposed. The algorithm is inspired by the natural behaviour of ants, which deposit pheromones in the environment to communicate. The benefit besides simulating ant behaviour in a colony is to design complex multi-agent systems. Complex behaviour can emerge from relatively simple interacting agents. The proposed Q-learning update equation includes a belief factor. The belief factor reflects the confidence the agent has in the pheromone detected in its environment. Agents communicate implicitly to co-operate in learning to solve a path-planning problem. The results indicate that combining synthetic pheromone with standard Q-learning speeds up the learning process. It will be shown that the agents can be biased towards a preferred solution by adjusting the pheromone deposit and evaporation rates.
机译:生物系统通常为人工系统的设计提供了启发。在激发研究人员灵感的这种自然系统的例子中,就是蚁群。本文提出了一种多智能体强化学习算法,一种改进的Q学习算法。该算法的灵感来自于蚂蚁的自然行为,这些蚂蚁将信息素沉积在环境中进行通信。除了模拟殖民地中的蚂蚁行为外,好处还在于设计复杂的多主体系统。相对简单的交互代理会产生复杂的行为。所提出的Q学习更新方程包括置信因子。信念因素反映了代理商对在其环境中检测到的信息素的信心。代理之间进行隐式沟通以合作学习以解决路径规划问题。结果表明,将合成信息素与标准Q学习结合起来可加快学习过程。将显示通过调节信息素沉积和蒸发速率,试剂可偏向优选溶液。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号