Anytime Point-Based Approximations for Large POMDPs

J. Pineau; G. Gordon; S. Thrun

首页> 外文期刊>Journal of Automation, Mobile Robotics & Intelligent Systems >Anytime Point-Based Approximations for Large POMDPs

【24h】

Anytime Point-Based Approximations for Large POMDPs

机译：大型POMDP的随时基于点的近似

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally intractable for all but the smallest problems. A well-known technique for speeding up POMDP solving involves performing value backups at specific belief points, rather than over the entire belief simplex. The efficiency of this approach, however, depends greatly on the selection of points. This paper presents a set of novel techniques for selecting informative belief points which work well in practice. The point selection procedure is combined with point-based value backups to form an effective anytime POMDP algorithm called Point-Based Value Iteration (PBVI). The first aim of this paper is to introduce this algorithm and present a theoretical analysis justifying the choice of belief selection technique. The second aim of this paper is to provide a thorough empirical comparison between PBVI and other state-of-the-art POMDP methods, in particular the Perseus algorithm, in an effort to highlight their similarities and differences. Evaluation is performed using both standard POMDP domains and realistic robotic tasks.

机译：长期以来，部分可观察的马尔可夫决策过程一直被认为是解决现实世界中计划和控制问题（尤其是机器人技术）的丰富框架。但是，除了最小的问题外，此框架中的精确解决方案通常在计算上难以解决。加快POMDP解决速度的一项众所周知的技术涉及在特定置信点而不是在整个置信单纯点上执行值备份。但是，这种方法的效率在很大程度上取决于点的选择。本文提出了一套新颖的技术，用于选择在实践中效果很好的信息性信念点。点选择过程与基于点的值备份相结合，可以形成一种有效的随时可用的POMDP算法，称为基于点的值迭代（PBVI）。本文的首要目的是介绍该算法，并提供理论分析，证明选择信念选择技术的合理性。本文的第二个目的是在PBVI和其他最新的POMDP方法（尤其是Perseus算法）之间进行透彻的经验比较，以强调它们的异同。使用标准POMDP域和实际的机器人任务来执行评估。

著录项

来源
《Journal of Automation, Mobile Robotics & Intelligent Systems》 |2006年第5期|共46页
作者
J. Pineau; G. Gordon; S. Thrun;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Anytime Point-Based Approximations for Large POMDPs [J] . Gordon G., Pineau J., Thrun S. The Journal of Artificial Intelligence Research . 2006,第5期

机译：大型POMDP的随时基于点的近似
2. Point-Based Value Iteration for Finite-Horizon POMDPs [J] . Walraven Erwin, Spaan Matthijs T. J. The Journal of Artificial Intelligence Research . 2019,第期

机译：有限地平线POMDPS的基于点的值迭代
3. A randomized point-based value iteration POMDP enhanced with a counting process technique for optimal management of multi-state multi-element systems [J] . Fereshtehnejad Ehsan, Shafieezadeh Abdollah Structural Safety . 2017,第期

机译：基于计数点技术的基于随机点的值迭代POMDP增强了多状态多元素系统的最优管理
4. Point-Based POMDP Solving with Factored Value Function Approximation [C] . Tiago S. Veiga, Matthijs T. J. Spaan, Pedro U. Lima AAAI Conference on Artificial Intelligence . 2014

机译：基于点的POMDP解决了因子价值函数近似
5. Point-Based POMDP Solvers: Survey and Comparative Analysis. [D] . Kaplow, Robert. 2010

机译：基于点的POMDP解决方案：调查和比较分析。
6. Gene regulatory network inference by point-based Gaussian approximation filtersincorporating the prior information [O] . Bin Jia, Xiaodong Wang 2013

机译：基于点的高斯近似滤波器的基因调控网络推断合并先前的信息
7. Anytime Point-Based Approximations for Large POMDPs [O] . Pineau, J., Gordon, G., Thrun, S. 2011

机译：任何时候大型pOmDp的基于点的近似

Anytime Point-Based Approximations for Large POMDPs

摘要

著录项

相似文献

相关主题

期刊订阅