首页> 外文会议>World Congress on Intelligent Control and Automation >A point-reduced POMDP value iteration algorithm with application to robot navigation
【24h】

A point-reduced POMDP value iteration algorithm with application to robot navigation

机译:一种点减少的POMDP值迭代算法应用于机器人导航

获取原文

摘要

The exact value iteration for POMDP planning is so complex that we use approximation to solve the problems in practice. In recent years, point-based algorithm has become a research hotspot. PBVI algorithm selects successors that improve the worst case density as rapidly as possible. The smaller the gaps between all belief points, the faster the value function converges to the optimal solutions. PBVI doubles the size of the belief set after each expansion. The exponential increase makes this algorithm incapable to solve problems with long horizons. However, there are some points in the set which have little contribution to the density. These points can be reduced to decrease the size of the set. Meanwhile, fewer points are expanded and more backups can be executed during each iteration. Based on this, this paper introduces a point-reduced POMDP value iteration algorithm and applied it to robot navigation problems. PRVI improves the original PBVI and is superior to other POMDP algorithms. Experiments supported that PRVI significantly improved the efficiency.
机译:POMDP规划的确切值迭代是如此复杂,我们使用近似来解决实践中的问题。近年来,基于点的算法已成为一个研究热点。 PBVI算法选择后续的继承人,可以尽可能快地提高最坏情况密度。所有信仰点之间的间隙越小,值函数越快收敛到最佳解决方案。 PBVI每次扩展后都会使信仰的大小翻倍。指数增加使得该算法无法解决长视野的问题。然而,该集合中有一些点对密度几乎没有贡献。可以减少这些点以降低集合的大小。同时,扩展的点较少,并且在每次迭代期间可以执行更多备份。基于此,本文介绍了一个减少的POMDP值迭代算法,并将其应用于机器人导航问题。 PRVI改善了原来的PBVI,优于其他POMDP算法。实验支持PRVI显着提高了效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号