首页> 外文期刊>The Journal of Artificial Intelligence Research >Perseus: Randomized Point-based Value Iteration for POMDPs
【24h】

Perseus: Randomized Point-based Value Iteration for POMDPs

机译:英仙座:POMDP的基于点的随机值迭代

获取原文
获取原文并翻译 | 示例
           

摘要

Partially observable Markov decision processes (POMDPs) form an attractive and principled framework for agent planning under uncertainty. Point-based approximate techniques for POMDPs compute a policy based on a finite set of points collected in advance from the agent's belief space. We present a randomized point-based value iteration algorithm called Perseus. The algorithm performs approximate value backup stages, ensuring that in each backup stage the value of each point in the belief set is improved; the key observation is that a single backup may improve the value of many belief points. Contrary to other point-based methods, Perseus backs up only a (randomly selected) subset of points in the belief set, sufficient for improving the value of each belief point in the set. We show how the same idea can be extended to dealing with continuous action spaces. Experimental results show the potential of Perseus in large scale POMDP problems.
机译:部分可观察的马尔可夫决策过程(POMDP)构成了不确定情况下代理商计划的有吸引力且原则性的框架。用于POMDP的基于点的近似技术可根据预先从代理的信念空间收集的有限点集来计算策略。我们提出了一种称为Perseus的基于随机点的值迭代算法。该算法执行近似值备份阶段,以确保在每个备份阶段中置信集中的每个点的值都得到改善;关键观察结果是,单个备份可以提高许多置信点的价值。与其他基于点的方法相反,Perseus仅备份置信集中的点的(子集)(随机选择),足以改善集中的每个置信点的值。我们展示了如何将相同的想法扩展到处理连续的动作空间。实验结果表明,Perseus在大规模POMDP问题中的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号