...
首页> 外文期刊>International Journal of Intelligence Science >Evaluating Effects of Two Alternative Filters for the Incremental Pruning Algorithm on Quality of Pomdp Exact Solutions
【24h】

Evaluating Effects of Two Alternative Filters for the Incremental Pruning Algorithm on Quality of Pomdp Exact Solutions

机译:增量修剪算法的两个替代过滤器对Pomdp精确解质量的影响评估

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Decision making is one of the central problems in artificial intelligence and specifically in robotics. In most cases this problem comes with uncertainty both in data received by the decision maker/agent and in the actions performed in the environment. One effective method to solve this problem is to model the environment and the agent as a Partially Observable Markov Decision Process (POMDP). A POMDP has a wide range of applications such as: Machine Vision, Marketing, Network troubleshooting, Medical diagnosis etc. In recent years, there has been a significant interest in developing techniques for finding policies for (POMDPs).We consider two new techniques, called Recursive Point Filter (RPF) and Scan Line Filter (SCF) based on Incremental Pruning (IP) POMDP solver to introduce an alternative method to Linear Programming (LP) filter for IP. Both, RPF and SCF have solutions for several POMDP problems that LP could not converge to in 24 hours. Experiments are run on problems from POMDP literature, and an Average Discounted Reward (ADR) is computed by testing the policy in a simulated environment.
机译:决策是人工智能特别是机器人技术中的核心问题之一。在大多数情况下,决策者/代理接收到的数据以及在环境中执行的操作都存在不确定性。解决此问题的一种有效方法是将环境和主体建模为部分可观察的马尔可夫决策过程(POMDP)。 POMDP具有广泛的应用,例如:机器视觉,市场营销,网络故障排除,医学诊断等。近年来,人们对开发用于查找(POMDP)策略的技术产生了浓厚的兴趣。我们考虑了两种新技术,称为递归点过滤器(RPF)和基于增量修剪(IP)POMDP求解器的扫描线过滤器(SCF),为IP线性编程(LP)过滤器引入了另一种方法。 RPF和SCF都为LP无法在24小时内收敛的几个POMDP问题提供了解决方案。针对POMDP文献中的问题进行了实验,并通过在模拟环境中测试策略来计算平均折扣奖励(ADR)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号