首页> 外文会议>World Congress on Intelligent Control and Automation >A point-reduced POMDP value iteration algorithm with application to robot navigation
【24h】

A point-reduced POMDP value iteration algorithm with application to robot navigation

机译:点降低POMDP值迭代算法及其在机器人导航中的应用

获取原文

摘要

The exact value iteration for POMDP planning is so complex that we use approximation to solve the problems in practice. In recent years, point-based algorithm has become a research hotspot. PBVI algorithm selects successors that improve the worst case density as rapidly as possible. The smaller the gaps between all belief points, the faster the value function converges to the optimal solutions. PBVI doubles the size of the belief set after each expansion. The exponential increase makes this algorithm incapable to solve problems with long horizons. However, there are some points in the set which have little contribution to the density. These points can be reduced to decrease the size of the set. Meanwhile, fewer points are expanded and more backups can be executed during each iteration. Based on this, this paper introduces a point-reduced POMDP value iteration algorithm and applied it to robot navigation problems. PRVI improves the original PBVI and is superior to other POMDP algorithms. Experiments supported that PRVI significantly improved the efficiency.
机译:POMDP规划的精确值迭代是如此复杂,以至于我们使用近似值来解决实际问题。近年来,基于点的算法已成为研究热点。 PBVI算法选择后继者,以尽快提高最坏情况下的密度。所有置信点之间的间隙越小,价值函数收敛到最优解的速度就越快。每次扩展后,PBVI会将信念集的大小加倍。指数级增长使得该算法无法解决长远问题。但是,集合中的某些点对密度的贡献很小。可以减少这些点以减小集合的大小。同时,扩展更少的点,并且在每次迭代期间可以执行更多的备份。在此基础上,本文提出了一种点减少的POMDP值迭代算法,并将其应用于机器人导航问题。 PRVI改进了原始的PBVI,并且优于其他POMDP算法。实验支持PRVI大大提高了效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号