首页> 外文会议>MICAI 2010;Mexican international conference on artificial intelligence >POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method
【24h】

POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method

机译:POMDP过滤器:使用Kaczmarz迭代方法修剪POMDP值函数

获取原文
获取外文期刊封面目录资料

摘要

In recent years, there has been significant interest in developing techniques for finding policies for Partially Observable Markov Decision Problems (POMDPs). This paper introduces a new POMDP filtering technique that is based on Incremental Pruning [1], but relies on geometries of hyperplane arrangements to compute for optimal policy. This new approach applies notions of linear algebra to transform hyperplanes and treat their intersections as witness points [5]. The main idea behind this technique is that a vector that has the highest value at any of the intersection points must be part of the policy. IPBS is an alternative of using linear programming (LP), which requires powerful and expensive libraries, and which is subjected to numerical instability.
机译:近年来,人们非常关注开发用于找到部分可观察的马尔可夫决策问题(POMDP)策略的技术。本文介绍了一种新的POMDP过滤技术,该技术基于增量修剪[1],但它依赖于超平面布置的几何形状来计算最佳策略。这种新方法运用线性代数的概念来变换超平面并将其相交视为见证点[5]。该技术背后的主要思想是,在任何交点处具有最高值的向量必须是策略的一部分。 IPBS是使用线性编程(LP)的替代方法,该方法需要功能强大且昂贵的库,并且存在数值不稳定的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号