Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

Christopher Amato; Daniel S. Bernstein; Shlomo Zilberstein

首页> 外文期刊>Autonomous agents and multi-agent systems >Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

【24h】

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

机译：针对POMDP和分散式POMDP优化固定大小的随机控制器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their high computational complexity, however, presents an important research challenge. One way to address the intractable memory requirements of current algorithms is based on representing agent policies as finite-state controllers. Using this representation, we propose a new approach that formulates the problem as a nonlinear program, which defines an optimal policy of a desired size for each agent. This new formulation allows a wide range of powerful nonlinear programming algorithms to be used to solve POMDPs and DEC-POMDPs. Although solving the NLP optimally is often intractable, the results we obtain using an off-the-shelf optimization method are competitive with state-of-the-art POMDP algorithms and outperform state-of-the-art DEC-POMDP algorithms. Our approach is easy to implement and it opens up promising research directions for solving POMDPs and DEC-POMDPs using nonlinear programming methods.

机译：POMDP及其分散的多代理程序DEC-POMDP为不确定性下的顺序决策提供了一个丰富的框架。然而，它们的高计算复杂度提出了重要的研究挑战。解决当前算法难以解决的内存需求的一种方法是基于将代理策略表示为有限状态控制器。使用这种表示，我们提出了一种将问题表述为非线性程序的新方法，该程序定义了每个代理所需大小的最佳策略。这种新的公式允许使用各种强大的非线性编程算法来求解POMDP和DEC-POMDP。尽管以最佳方式求解NLP通常很棘手，但我们使用现成的优化方法获得的结果与最新的POMDP算法相比，在性能上优于最新的DEC-POMDP算法。我们的方法易于实施，并且为使用非线性编程方法求解POMDP和DEC-POMDP开辟了有希望的研究方向。

著录项

来源
《Autonomous agents and multi-agent systems》 |2010年第3期|P.293-320|共28页
作者
Christopher Amato; Daniel S. Bernstein; Shlomo Zilberstein;
展开▼
作者单位

Department of Computer Science, University of Massachusetts, Amherst, MA 01003, USA;

Department of Computer Science, University of Massachusetts, Amherst, MA 01003, USA;

Department of Computer Science, University of Massachusetts, Amherst, MA 01003, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
decision theory; multiagent systems; planning under uncertainty; POMDPs; DEC-POMDPs;

机译：决策理论多代理系统;在不确定的情况下进行计划;POMDP;DEC-POMDP;

相似文献

外文文献
中文文献
专利

1. On the Computational Complexity of Stochastic Controller Optimization in POMDPs [J] . NIKOS VLASSIS, MICHAEL L. LITTMAN, DAVID BARBER ACM Transactions on Computational Theory . 2012,第4期

机译：POMDP中随机控制器优化的计算复杂性
2. On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP [J] . Huizhen Yu Dimitri P. Bertsekas Mathematics of Operations Research . 2008,第1期

机译：平均成本POMDP的有限状态控制器集的近似最优性
3. On near optimality of the set of finite-state controllers for average cost POMDP [J] . Yu HZ, Bertsekas DP Mathematics of operations research . 2008,第1期

机译：平均成本POMDP的有限状态控制器集的近最优性
4. Optimizing Fixed-Size Stochastic Controllers for POMDPs [C] . Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein AAAI Workshop . 2008

机译：优化POMDPS的固定尺寸随机控制器
5. Optimizing Cancer Screening with POMDPs [D] . Petousis, Panayiotis 2019

机译：使用POMDP优化癌症筛查
6. Modeling and Planning with Macro-Actions in Decentralized POMDPs [O] . Christopher Amato, George Konidaris, Leslie P. Kaelbling, -1

机译：在分散的POMDP中使用宏动作进行建模和计划
7. Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs [O] . Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein 2009

机译：针对POMDP和分散式POMDP优化固定大小的随机控制器

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅