Network Distributed POMDP with Communication

机译：网络分布式POMDP与通信

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While Distributed POMDPs have become popular for modeling multiagent systems in uncertain domains, it is the Network Distributed POMDPs (ND-POMDPs) model that has begun to scale-up the number of agents. The ND-POMDPs can utilize the locality in agents' interactions. However, prior work in ND-POMDPs has failed to address communication. Without communication, the size of a local policy at each agent within the ND-POMDPs grows exponentially in the time horizon. To overcome this problem, we extend existing algorithms so that agents periodically communicate their observation and action histories with each other. After communication, agents can start from new synchronized belief state. Thus, we can avoid the exponential growth in the size of local policies at agents. Furthermore, we introduce an idea that is similar the Point-based Value Iteration algorithm to approximate the value function with a fixed number of representative points. Our experimental results show that we can obtain much longer policies than existing algorithms as long as the interval between communications is small.

机译：虽然分布式POMDPS已经成为在不确定域中建模多层系统的流行，但它是网络分布式POMDPS（ND-POMDPS）模型，该模型已经开始扩大了代理的数量。 ND-POMDP可以利用代理商的相互作用中的局部性。但是，在ND-POMDPS中的事先工作未能解决沟通。在没有通信的情况下，ND-POMDPS内的每个代理的本地政策的大小在时间范围内呈指数增长。为了克服这个问题，我们扩展了现有的算法，使得代理周期性地将其观察和动作历史彼此传达。在沟通之后，代理商可以从新同步信念状态开始。因此，我们可以避免代理人的当地政策规模的指数增长。此外，我们介绍了类似于基于点的值迭代算法，以近似于具有固定数量的代表点的值函数。我们的实验结果表明，只要通信之间的间隔小，我们就可以获得比现有算法更长的政策。

著录项

来源
《Annual Conference of the Japanese Society for Artificial Intelligence》|2009年||共13页
会议地点
作者
Yuki Iwanari; Yuichi Yabu; Makoto Tasaki; Makoto Yokoo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Securing Emergency Communications for the Next-Generation Networks: Performance Analysis of Distributed Trust Computing for Future Emergency Communications [J] . P. Asuquo, H. Cruickshank, C.P.A.Ogah Performance evaluation review . 2018,第3期

机译：保障下一代网络的紧急通信的安全：未来应急通信的分布式信任计算性能分析
2. Optimal Distributed Detection for Complex Field Network Coding Assisted Wireless Sensor Networks with Nonorthogonal Communications [J] . Cheng Yulun, Yang Longxiang, Zhu Hongbo Mobile Information Systems . 2016,第PTa5期

机译：具有非正交通信的复杂现场网络编码辅助无线传感器网络的最优分布式检测
3. Cooperative communications based on rateless network coding in distributed MIMO systems [Coordinated and Distributed MIMO] [J] . Li X., Jiang T., Cui S., Wireless Communications, IEEE . 2010,第3期

机译：分布式MIMO系统中基于无速率网络编码的协作通信[Coordinated and Distributed MIMO]
4. Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs [C] . Ranjit Nair, Pradeep Varakantham, Milind Tambe, International Joint Conference on Artificial Intelligence . 2007

机译：网络分布式POMDPS：分布式约束优化和POMDPS的协同作用
5. Exploiting locality of interaction in networked distributed POMDPs: An empirical evaluation [D] . Kim, Yoonheui. 2006

机译：利用网络分布式POMDP中交互的局部性：一项实证评估
6. Distributing Digital Imaging and Communications in Medicine data and optimizing access over satellite networks [O] . Randy D. Ernst, Akira Kawashima, William Shepherd, 1999

机译：分发医学数据中的数字成像和通信并优化通过卫星网络的访问
7. Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs [O] . Ranjit Nair, Pradeep Varakantham, Milind Tambe, 2005

机译：网络化分布式pOmDp：分布式约束优化和pOmDp的综合
8. Telecommunication Networks for Mobile & Distributed Communications/ Computing [R] . Roy, S. 2001

机译：移动和分布式通信/计算的电信网络

Network Distributed POMDP with Communication

摘要

著录项

相似文献

相关主题

期刊订阅