Phe-Q: A Pheromone Based Q-Learning

机译：Phe-Q：基于信息素的Q学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Biological systems have often provided inspiration for the design of artificial systems. On such example of a natural system that has inspired researchers is the ant colony. In this paper an algorithm for multi-agent reinforcement learning, a modified Q-learning, is proposed. The algorithm is inspired by the natural behaviour of ants, which deposit pheromones in the environment to communicate. The benefit besides simulating ant behaviour in a colony is to design complex multi-agent systems. Complex behaviour can emerge from relatively simple interacting agents. The proposed Q-learning update equation includes a belief factor. The belief factor reflects the confidence the agent has in the pheromone detected in its environment. Agents communicate implicitly to co-operate in learning to solve a path-planning problem. The results indicate that combining synthetic pheromone with standard Q-learning speeds up the learning process. It will be shown that the agents can be biased towards a preferred solution by adjusting the pheromone deposit and evaporation rates.

机译：生物系统通常为人工系统的设计提供了启发。在激发研究人员灵感的这种自然系统的例子中，就是蚁群。本文提出了一种多智能体强化学习算法，一种改进的Q学习算法。该算法的灵感来自于蚂蚁的自然行为，这些蚂蚁将信息素沉积在环境中进行通信。除了模拟殖民地中的蚂蚁行为外，好处还在于设计复杂的多主体系统。相对简单的交互代理会产生复杂的行为。所提出的Q学习更新方程包括置信因子。信念因素反映了代理商对在其环境中检测到的信息素的信心。代理之间进行隐式沟通以合作学习以解决路径规划问题。结果表明，将合成信息素与标准Q学习结合起来可加快学习过程。将显示通过调节信息素沉积和蒸发速率，试剂可偏向优选溶液。

著录项

来源
《14th Australian Joint Conference on Artificial Intelligence, 14th, Dec 10-14, 2001, Adelaide, Australia》|2001年|p.345-355|共11页
会议地点 Adelaide(AU)
作者
Ndedi Monekosso; Paolo Remagnino;
展开▼
作者单位

Digital Imaging Research Centre School of Computing and Information Systems Kingston University, United Kingdom;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
machine learning; reinforcement learning; multi-agent system;

机译：机器学习强化学习；多主体系统;

相似文献

外文文献
中文文献
专利

1. Weighted Rendezvous Planning on Q-Learning Based Adaptive Zone Partition with PSO Based Optimal Path Selection [J] . Senthil Kumar V., Prasanth K. Wireless personal communications: An Internaional Journal . 2020,第1期

机译：基于PSO的最优路径选择对Q学基于自适应区分区的加权约合规划
2. The aggregation-sex pheromones of the cerambycid beetles Anaglyptus mysticus and Xylotrechus antilope ssp. antilope: new model species for insect conservation through pheromone-based monitoring [J] . Molander Mikael A., Eriksson Bjorn, Winde Inis B., Chemoecology: An International Journal Emphasizing Evolutionary Approaches to Chemical Ecology . 2019,第3期

机译：Cherambycid甲虫的聚集性鉴定性Anaglyptus mysticus和Xulotrechus抗岩SSP。安脱石：信息素基于信息素监测的新模型物种
3. Sex pheromones and trail-following pheromone in the basal termites Zootermopsis nevadensis (Hagen) and Z-angusticollis (Hagen) (Isoptera: Termopsidae: Termopsinae) [J] . Bordereau Christian, Lacey Michael J., Semon Etienne, Biological Journal of the Linnean Society . 2010,第3期

机译：基础白蚁Zootermopsis nevadensis（Hagen）和Z-angusticollis（Hagen）（Isoptera：Termopsidae：Termopsinae）中的性信息素和尾随信息素
4. Phe-Q: A Pheromone Based Q-Learning [C] . Ndedi Monekosso, Paolo Remagnino Australian Joint Conference on Artificial Intelligence . 2001

机译：PHE-Q：基于信息素的Q-Learning
5. A Q-Learning Based Integrated Variable Speed Limit and Hard Shoulder Running Control to Reduce Travel Time at Freeway Bottleneck [D] . Zhou, Weiyi. 2019

机译：基于Q学的集成速度限制和硬肩控制，以减少高速公路瓶颈的旅行时间
6. Double Deep Q-Learning and Faster R-CNN-Based Autonomous Vehicle Navigation and Obstacle Avoidance in Dynamic Environment [O] . Razin Bin Issa, Modhumonty Das, Md. Saferi Rahman, 2021

机译：双层Q-Learning和更快的R-CNN自主车辆导航和动态环境中的避难
7. BLER-based Adaptive Q-learning for Efficient Random Access in NOMA-based mMTC Networks [O] . Duc-Dung Tran, Shree Krishna Sharma, Symeon Chatzinotas 2021

机译：基于BLER的自适应Q学习，用于基于NOMA的MMTC网络中有效随机访问

Phe-Q: A Pheromone Based Q-Learning

摘要

著录项

相似文献

相关主题

期刊订阅