首页> 外文会议>International Conference on Autonomous Agents and Multiagent Systems >Reward shaping for valuing communications during multi-agent coordination
【24h】

Reward shaping for valuing communications during multi-agent coordination

机译:奖励塑造估值在多助理协调期间的沟通

获取原文

摘要

Decentralised coordination in multi-agent systems is typically achieved using communication. However, in many cases, communication is expensive to utilise because there is limited bandwidth, it may be dangerous to communicate, or communication may simply be unavailable at times. In this context, we argue for a rational approach to communication --- if it has a cost, the agents should be able to calculate a value of communicating. By doing this, the agents can balance the need to communicate with the cost of doing so. In this research, we present a novel model of rational communication, that uses reward shaping to value communications, and employ this valuation in decentralised POMDP policy generation. In this context, reward shaping is the process by which expectations over joint actions are adjusted based on how coordinated the agent team is. An empirical evaluation of the benefits of this approach is presented in two domains. First, in the context of an idealised bench-mark problem, the multiagent Tiger problem, our method is shown to require significantly less communication (up to 30% fewer messages) and still achieves a 30% performance improvement over the current state of the art. Second, in the context of a larger-scale problem, RoboCupRescue, our method is shown to scale well, and operate without recourse to significant amounts of domain knowledge.
机译:通常使用通信实现多种代理系统中的分散协调。然而,在许多情况下,利用通信是昂贵的,因为带宽有限,通信可能是危险的,或者在次的情况下可以简单地是不可用的。在这方面,我们争论通信的合理方法 - 如果它具有成本,则代理商应该能够计算通信的价值。通过这样做,代理商可以平衡与这样做的成本进行通信的必要性。在这项研究中,我们提出了一种新颖的合理通信模型,它利用奖励塑造来重视通信,并在分散的POMDP策略生成中雇用这一估值。在这种情况下,奖励整形是根据代理团队的协调如何调整对联合行动的期望的过程。在两个域中介绍了对这种方法的益处的实证评价。首先,在理想化的工作台标记问题的上下文中,我们的方法显示,我们的方法需要显着更少的沟通(最多30%的消息),并且仍然实现了对现有技术的30%的性能改进。其次,在更大的问题的上下文中,Robocuprescue,我们的方法显示得很好,并在没有求助的情况下运行到大量的域知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号