首页> 外文期刊>Web Intelligence and Agent Systems >Introducing communication in Dis-POMDPs with locality of interaction
【24h】

Introducing communication in Dis-POMDPs with locality of interaction

机译:在Dis-POMDP中引入具有交互作用的通信

获取原文
获取原文并翻译 | 示例

摘要

The Networked Distributed POMDPs (ND-POMDPs) can model multiagent systems in uncertain domains and have begun to scale-up the number of agents. However, prior work in ND-POMDPs has failed to address communication. Without communication, the size of a local policy at each agent within the ND-POMDPs grows exponentially in the time horizon. To overcome this problem, we extend existing algorithms so that agents periodically communicate their observation and action histories with each other. After communication, agents can start from new synchronized belief state. Thus, we can avoid the exponential growth in the size of local policies at agents. Furthermore, we introduce an idea that is similar to the Point-based Value Iteration algorithm to approximate the value function with a fixed number of representative points. Our experimental results show that we can obtain much longer policies than existing algorithms as long as the interval between communications is small.
机译:网络分布式POMDP(ND-POMDP)可以在不确定的域中对多代理系统进行建模,并且已经开始扩大代理的数量。但是,ND-POMDP中的先前工作未能解决通信问题。如果没有通信,则ND-POMDP中每个代理的本地策略的大小会在时间范围内呈指数增长。为克服此问题,我们扩展了现有算法,以便代理定期相互交流其观察和动作历史。进行通信后,代理可以从新的同步置信状态开始。因此,我们可以避免代理商的本地政策规模呈指数增长。此外,我们引入了一种类似于基于点的值迭代算法的想法,以固定数量的代表点来近似值函数。我们的实验结果表明,只要通信之间的间隔很小,我们可以获得比现有算法更长的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号