首页> 外文会议>Conference on uncertainty in artificial intelligence >Scaling Up Decentralized MDPs Through Heuristic Search

【24h】

Scaling Up Decentralized MDPs Through Heuristic Search

机译：通过启发式搜索扩大去中心化MDP

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Decentralized partially observable Markov decision processes (Dec-POMDPs) are rich models for cooperative decision-making under uncertainty, but are often intractable to solve optimally (NEXP-complete). The transition and observation independent Dec-MDP is a general subclass that has been shown to have complexity in NP, but optimal algorithms for this subclass are still inefficient in practice. In this paper, we first provide an updated proof that an optimal policy does not depend on the histories of the agents, but only the local observations. We then present a new algorithm based on heuristic search that is able to expand search nodes by using constraint optimization. We show experimental results comparing our approach with the state-of-the-art Dec-MDP and Dec-POMDP solvers. These results show a reduction in computation time and an increase in scalability by multiple orders of magnitude in a number of benchmarks.

机译：分散的，部分可观察的马尔可夫决策过程（Dec-POMDPs）是不确定条件下合作决策的丰富模型，但通常很难解决（NEXP-complete）。独立于过渡和观测的Dec-MDP是一个通用子类，已被证明在NP中具有复杂性，但是针对该子类的最佳算法在实践中仍然效率低下。在本文中，我们首先提供更新的证据，即最优政策不取决于代理商的历史，而仅取决于本地观察。然后，我们提出一种基于启发式搜索的新算法，该算法能够通过使用约束优化来扩展搜索节点。我们展示了将我们的方法与最新的Dec-MDP和Dec-POMDP求解器进行比较的实验结果。这些结果表明，在许多基准测试中，计算时间减少了，可扩展性提高了多个数量级。

著录项

来源
《Conference on uncertainty in artificial intelligence 》|2012年|217-226|共10页
会议地点
作者
Jilles S. Dibangoye; Christopher A ma to; Arnaud Doniec;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. FluCaP: A Heuristic Search Planner for First-Order MDPs [J] . Hoelldobler S., Karabaev E., Skvortsova O. The Journal of Artificial Intelligence Research . 2006 ,第12期

机译：FluCaP：一阶MDP的启发式搜索计划器
2. FluCaP: A Heuristic Search Planner for First-Order MDPs [J] . S. Hoelldobler, E. Karabaev, O. Skvortsova Journal of Automation, Mobile Robotics & Intelligent Systems . 2006 ,第5期

机译：FluCaP：一阶MDP的启发式搜索计划器
3. The Cross-Entropy Method for Policy Search in Decentralized POMDPs [J] . Frans A. Oliehoek, Julian F.P. Kooij, Nikos Vlassis Informatica: An International Journal of Computing and Informatics . 2008 ,第4期

机译：分散式POMDP中策略搜索的交叉熵方法
4. Scaling Up Decentralized MDPs Through Heuristic Search [C] . Jilles S. Dibangoye, Christopher Amato, Arnaud Doniec Conference on Uncertainty in Artificial Intelligence . 2012

机译：通过启发式搜索扩展分散的MDP
5. Speeding up the convergence of online heuristic search and scaling up offline heuristic search. [D] . Furcy, David A. 2004

机译：加快在线启发式搜索的融合并扩大离线启发式搜索。
6. Theoretical Analysis of Heuristic Search Methods for Online POMDPs [O] . Stéphane Ross, Joelle Pineau, Brahim Chaib-draa -1

机译：在线POMDP启发式搜索方法的理论分析
7. FluCaP: A Heuristic Search Planner for First-Order MDPs [O] . Hoelldobler, S., Karabaev, E., Skvortsova, O. 2011

机译：FluCap：一阶mDp的启发式搜索计划程序

Scaling Up Decentralized MDPs Through Heuristic Search

摘要

著录项

相似文献

相关主题

期刊订阅