Generalized Controllers in POMDP Decision-Making

机译：POMDP决策中的通用控制器

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a general policy formulation for partially observable Markov decision processes (POMDPs) called controller family policies that may be used as a framework to facilitate the design of new policy forms. We prove how modern approximate policy forms: point-based, finite state controller (FSC), and belief compression, are instances of this family of generalized controller policies. Our analysis provides a deeper understanding of the POMDP model and suggests novel ways to design POMDP solutions that can combine the benefits of different state-of-the-art methods. We illustrate this capability by creating a new customized POMDP policy form called the belief-integrated FSC (BI-FSC) tailored to overcome the shortcomings of a state-of-the-art algorithm that uses non-linear programming (NLP). Specifically, experiments show that for NLP the BI-FSC offers improved performance over a vanilla FSC-based policy form on benchmark domains. Furthermore, we demonstrate the BI-FSC's execution on a real robot navigating in a maze environment. Results confirm the value of using the controller family policy as a framework to design customized policies in POMDP robotic solutions.

机译：我们为部分可观察到的马尔可夫决策过程（POMDP）提供了一种通用的政策制定方法，称为控制者家庭政策，该政策可以用作促进设计新政策形式的框架。我们证明了现代的近似策略形式：基于点的有限状态控制器（FSC）和置信度压缩，是该系列广义控制器策略的实例。我们的分析提供了对POMDP模型的更深入的理解，并提出了设计POMDP解决方案的新颖方法，这些方法可以结合各种最新方法的优点。我们通过创建一种称为信念集成FSC（BI-FSC）的新定制POMDP策略表单来说明这种功能，该表单旨在克服使用非线性编程（NLP）的最新算法的缺点。具体而言，实验表明，对于NLP，BI-FSC的性能优于基准域上基于FSC的原始策略形式。此外，我们演示了BI-FSC在迷宫环境中导航的真实机器人上的执行情况。结果证实了使用控制器系列策略作为框架来设计POMDP机器人解决方案中的自定义策略的价值。

著录项

来源
《International Conference on Robotics and Automation》|2019年|7166-7172|共7页
会议地点 Montreal(CA)
作者
Kyle Hollins Wray; Shlomo Zilberstein;
展开▼
作者单位

University of Massachusetts Amherst MA 01002 USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Power capacitors; Mathematical model; Robots; Approximation algorithms; Markov processes; Process control; Navigation;

机译：功率电容器；数学模型;机器人；近似算法；马尔可夫过程；过程控制;导航;

相似文献

外文文献
中文文献
专利

1. Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs [J] . Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein Autonomous agents and multi-agent systems . 2010,第3期

机译：针对POMDP和分散式POMDP优化固定大小的随机控制器
2. αPOMDP: POMDP-based user-adaptive decision-making for social robots [J] . Martins Goncalo S., Al Tair Hend, Santos Luis, Pattern recognition letters . 2019,第FEBa期

机译：αPOMDP：基于POMDP的社交机器人用户自适应决策
3. On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP [J] . Huizhen Yu Dimitri P. Bertsekas Mathematics of Operations Research . 2008,第1期

机译：平均成本POMDP的有限状态控制器集的近似最优性
4. Generalized Controllers in POMDP Decision-Making [C] . Kyle Hollins Wray, Shlomo Zilberstein International Conference on Robotics and Automation . 2019

机译：POMDP决策中的广义控制器
5. Controller certification: The generalized stability margin inference for a large number of MIMO controllers. [D] . Park, Jisang. 2008

机译：控制器认证：针对大量MIMO控制器的广义稳定性裕度推断。
6. A Low T Regulatory Cell Response May Contribute to Both Viral Control and Generalized Immune Activation in HIV Controllers [O] . Peter W. Hunt, Alan L. Landay, Elizabeth Sinclair, 2009

机译：低T调节细胞反应可能有助于HIV控制者中的病毒控制和广泛的免疫激活。
7. Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs [O] . Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein 2009

机译：针对POMDP和分散式POMDP优化固定大小的随机控制器

Generalized Controllers in POMDP Decision-Making

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅