Online Planning Algorithms for POMDPs

机译：POMDP的在线计划算法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local policies at each decision step during the execution. Online algorithms generally consist of a lookahead search to find the best action to execute at each time step in an environment. Our objectives here are to survey the various existing online POMDP methods, analyze their properties and discuss their advantages and disadvantages; and to thoroughly evaluate these online approaches in different environments under various metrics (return, error bound reduction, lower bound improvement). Our experimental results indicate that state-of-the-art online heuristic search methods can handle large POMDP domains efficiently.

机译：部分可观察的马尔可夫决策过程（POMDP）为随机领域中不确定性下的顺序决策提供了一个丰富的框架。但是，解决POMDP通常很困难，除了一些小问题（由于其复杂性）。在这里，我们专注于通过在执行过程中的每个决策步骤计算良好的本地策略来减轻计算复杂性的在线方法。在线算法通常由先行搜索组成，以查找在环境中的每个时间步执行的最佳操作。我们的目标是调查各种现有的在线POMDP方法，分析其特性并讨论其优缺点；并全面评估这些在线方法在不同环境下的各种指标（收益，减少错误界限，改善下限）。我们的实验结果表明，最新的在线启发式搜索方法可以有效处理大型POMDP域。

著录项

期刊名称 other
作者
Stéphane Ross; Joelle Pineau; Sébastien Paquet; Brahim Chaib-draa;
展开▼
作者单位

展开▼
年(卷),期 -1(32),2
年度 -1
页码 663–704
总页数 51
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Online Planning Algorithms for POMDPs [J] . Chaib-draa B., Paquet S., Pineau J., The Journal of Artificial Intelligence Research . 2008,第4期

机译：POMDP的在线计划算法
2. Online Planning Algorithms for POMDPs [J] . S. Ross, J. Pineau, S. Paquet, Journal of Automation, Mobile Robotics & Intelligent Systems . 2008,第1期

机译：POMDP的在线计划算法
3. Online Planning Algorithms for POMDPs [J] . Stephane Ross, Joelle Pineau, Sebastien Paquet, The Journal of Artificial Intelligence Research . 2008,第0期

机译：POMDP的在线计划算法
4. An improved Monte Carlo POMDPs online planning algorithm combined with RAVE heuristic [C] . Peigen Liu, Jing Chen, Hongfu Liu IEEE International Conference on Software Engineering and Service Science . 2015

机译：结合RAVE启发式的改进蒙特卡洛POMDPs在线规划算法。
5. A Bayesian Framework for Online Parameter Learning in POMDPs. [D] . Atrash, Amin. 2011

机译：用于POMDP中在线参数学习的贝叶斯框架。
6. Theoretical Analysis of Heuristic Search Methods for Online POMDPs [O] . Stéphane Ross, Joelle Pineau, Brahim Chaib-draa -1

机译：在线POMDP启发式搜索方法的理论分析
7. Online Planning Algorithms for POMDPs [O] . Ross, Stéphane, Pineau, Joelle, Paquet, Sébastien, 2014

机译：pOmDp的在线计划算法

Online Planning Algorithms for POMDPs

摘要

著录项

相似文献

相关主题

期刊订阅