首页> 外文会议>International Conference on Robotics and Automation >Generalized Controllers in POMDP Decision-Making
【24h】

Generalized Controllers in POMDP Decision-Making

机译:POMDP决策中的通用控制器

获取原文
获取外文期刊封面目录资料

摘要

We present a general policy formulation for partially observable Markov decision processes (POMDPs) called controller family policies that may be used as a framework to facilitate the design of new policy forms. We prove how modern approximate policy forms: point-based, finite state controller (FSC), and belief compression, are instances of this family of generalized controller policies. Our analysis provides a deeper understanding of the POMDP model and suggests novel ways to design POMDP solutions that can combine the benefits of different state-of-the-art methods. We illustrate this capability by creating a new customized POMDP policy form called the belief-integrated FSC (BI-FSC) tailored to overcome the shortcomings of a state-of-the-art algorithm that uses non-linear programming (NLP). Specifically, experiments show that for NLP the BI-FSC offers improved performance over a vanilla FSC-based policy form on benchmark domains. Furthermore, we demonstrate the BI-FSC's execution on a real robot navigating in a maze environment. Results confirm the value of using the controller family policy as a framework to design customized policies in POMDP robotic solutions.
机译:我们为部分可观察到的马尔可夫决策过程(POMDP)提供了一种通用的政策制定方法,称为控制者家庭政策,该政策可以用作促进设计新政策形式的框架。我们证明了现代的近似策略形式:基于点的有限状态控制器(FSC)和置信度压缩,是该系列广义控制器策略的实例。我们的分析提供了对POMDP模型的更深入的理解,并提出了设计POMDP解决方案的新颖方法,这些方法可以结合各种最新方法的优点。我们通过创建一种称为信念集成FSC(BI-FSC)的新定制POMDP策略表单来说明这种功能,该表单旨在克服使用非线性编程(NLP)的最新算法的缺点。具体而言,实验表明,对于NLP,BI-FSC的性能优于基准域上基于FSC的原始策略形式。此外,我们演示了BI-FSC在迷宫环境中导航的真实机器人上的执行情况。结果证实了使用控制器系列策略作为框架来设计POMDP机器人解决方案中的自定义策略的价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号