...
首页> 外文期刊>Journal of Automation, Mobile Robotics & Intelligent Systems >Anytime Point-Based Approximations for Large POMDPs
【24h】

Anytime Point-Based Approximations for Large POMDPs

机译:大型POMDP的随时基于点的近似

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally intractable for all but the smallest problems. A well-known technique for speeding up POMDP solving involves performing value backups at specific belief points, rather than over the entire belief simplex. The efficiency of this approach, however, depends greatly on the selection of points. This paper presents a set of novel techniques for selecting informative belief points which work well in practice. The point selection procedure is combined with point-based value backups to form an effective anytime POMDP algorithm called Point-Based Value Iteration (PBVI). The first aim of this paper is to introduce this algorithm and present a theoretical analysis justifying the choice of belief selection technique. The second aim of this paper is to provide a thorough empirical comparison between PBVI and other state-of-the-art POMDP methods, in particular the Perseus algorithm, in an effort to highlight their similarities and differences. Evaluation is performed using both standard POMDP domains and realistic robotic tasks.
机译:长期以来,部分可观察的马尔可夫决策过程一直被认为是解决现实世界中计划和控制问题(尤其是机器人技术)的丰富框架。但是,除了最小的问题外,此框架中的精确解决方案通常在计算上难以解决。加快POMDP解决速度的一项众所周知的技术涉及在特定置信点而不是在整个置信单纯点上执行值备份。但是,这种方法的效率在很大程度上取决于点的选择。本文提出了一套新颖的技术,用于选择在实践中效果很好的信息性信念点。点选择过程与基于点的值备份相结合,可以形成一种有效的随时可用的POMDP算法,称为基于点的值迭代(PBVI)。本文的首要目的是介绍该算法,并提供理论分析,证明选择信念选择技术的合理性。本文的第二个目的是在PBVI和其他最新的POMDP方法(尤其是Perseus算法)之间进行透彻的经验比较,以强调它们的异同。使用标准POMDP域和实际的机器人任务来执行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号