首页> 外文会议>Mexican International Conference on Artificial Intelligence >POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method
【24h】

POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method

机译:POMDP过滤器:用Kaczmarz迭代方法修剪POMDP值函数

获取原文

摘要

In recent years, there has been significant interest in developing techniques for finding policies for Partially Observable Markov Decision Problems (POMDPs). This paper introduces a new POMDP filtering technique that is based on Incremental Pruning [1], but relies on geometries of hyperplane arrangements to compute for optimal policy. This new approach applies notions of linear algebra to transform hyperplanes and treat their intersections as witness points [5]. The main idea behind this technique is that a vector that has the highest value at any of the intersection points must be part of the policy. IPBS is an alternative of using linear programming (LP), which requires powerful and expensive libraries, and which is subjected to numerical instability.
机译:近年来,对开发用于寻找部分可观察的马尔可夫决策问题(POMDPS)的政策的技能产生了重大兴趣。本文介绍了一种新的POMDP滤波技术,基于增量修剪[1],但依赖于超平面安排的几何形状计算,以计算最佳政策。这种新方法适用线性代数的概念来转换超平面并将其交叉点视为见证点[5]。这种技术背后的主要思想是,在任何交叉点处具有最高值的矢量必须是策略的一部分。 IPB是使用线性编程(LP)的替代方案,这需要强大且昂贵的库,并且受到数值不稳定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号