首页> 外文会议>International Conference on Automated Planning and Scheduling(ICAPS 2007); 2007; >Mixed Integer Linear Programming For Exact Finite-Horizon Planning In Decentralized Pomdps
【24h】

Mixed Integer Linear Programming For Exact Finite-Horizon Planning In Decentralized Pomdps

机译:混合整数线性规划用于分散Pomdps中的精确有限时间规划

获取原文
获取原文并翻译 | 示例

摘要

We consider the problem of finding an n-agent joint-policy for the optimal finite-horizon control of a decentralized Pomdp (Dec-Pomdp). This is a problem of very high complexity (NEXP-hard in n ≥ 2). In this paper, we propose a new mathematical programming approach for the problem. Our approach is based on two ideas: First, we represent each agent's policy in the sequence-form and not in the tree-form, thereby obtaining a very compact representation of the set of joint-policies. Second, using this compact representation, we solve this problem as an instance of combinatorial optimization for which we formulate a mixed integer linear program (MILP). The optimal solution of the MILP directly yields an optimal joint-policy for the Dec-Pomdp. Computational experience shows that formulating and solving the MILP requires significantly less time to solve benchmark Dec-Pomdp problems than existing algorithms. For example, the multi-agent tiger problem for horizon 4 is solved in 72 sees with the MILP whereas existing algorithms require several hours to solve it.
机译:我们考虑为分散Pomdp(Dec-Pomdp)的最佳有限水平控制寻找n-agent联合策略的问题。这是一个非常高的复杂性问题(n≥2的NEXP-hard)。在本文中,我们针对该问题提出了一种新的数学编程方法。我们的方法基于两个思想:首先,我们以顺序形式而非树形形式表示每个代理的策略,从而获得联合策略集合的非常紧凑的表示形式。其次,使用这种紧凑的表示形式,我们将解决此问题作为组合优化的一个实例,为此我们制定了一个混合整数线性程序(MILP)。 MILP的最佳解决方案直接为Dec-Pomdp产生最佳的联合策略。计算经验表明,与现有算法相比,制定和解决MILP需要花费更少的时间来解决基准Dec-Pomdp问题。例如,使用MILP在72眼中解决了地平线4的多主体Tiger问题,而现有算法需要几个小时才能解决。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号