首页> 外文会议>International Joint Conference on Autonomous Agents and Multiagent Systems >Message-Passing Algorithms for Large Structured Decentralized POMDPs
【24h】

Message-Passing Algorithms for Large Structured Decentralized POMDPs

机译:用于大型结构分散POMDP的消息传递算法

获取原文

摘要

Decentralized POMDPs provide a rigorous framework for multi-agent decision-theoretic planning. However, their high complexity has limited scalability. In this work, we present a promising new class of algorithms based on probabilistic inference for infinite-horizon ND-POMDPs - a restricted Dec-POMDP model. We first transform the policy optimization problem to that of likelihood maximization in a mixture of dynamic Bayes nets (DBNs). We then develop the Expectation-Maximization (EM) algorithm for maximizing the likelihood in this representation. The EM algorithm for ND-POMDPs lends itself naturally to a simple message-passing paradigm guided by the agent interaction graph. It is thus highly scalable w.r.t. the number of agents, can be easily parallelized, and produces good quality solutions.
机译:分散的POMDPS为多代理决策定理规划提供了严格的框架。然而,它们的高复杂性具有有限的可扩展性。在这项工作中,我们在无限地平线ND-POMDPS的概率推断上提出了一个有前途的新类算法 - 限制的DEC-POMDP模型。我们首先将策略优化问题转换为动态贝叶斯网(DBNS)的混合中的似然最大化。然后,我们开发期望 - 最大化(EM)算法以最大化该表示中的可能性。 ND-POMDPS的EM算法自然地引用代理交互图引导的简单消息传递范式。因此,它是高度可扩展的w.r.t.代理的数量可以很容易地,并产生良好的质量解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号