首页> 外文会议>World Congress on Intelligent Control and Automation >A point-reduced POMDP value iteration algorithm with application to robot navigation

【24h】

A point-reduced POMDP value iteration algorithm with application to robot navigation

机译：一种点减少的POMDP值迭代算法应用于机器人导航

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The exact value iteration for POMDP planning is so complex that we use approximation to solve the problems in practice. In recent years, point-based algorithm has become a research hotspot. PBVI algorithm selects successors that improve the worst case density as rapidly as possible. The smaller the gaps between all belief points, the faster the value function converges to the optimal solutions. PBVI doubles the size of the belief set after each expansion. The exponential increase makes this algorithm incapable to solve problems with long horizons. However, there are some points in the set which have little contribution to the density. These points can be reduced to decrease the size of the set. Meanwhile, fewer points are expanded and more backups can be executed during each iteration. Based on this, this paper introduces a point-reduced POMDP value iteration algorithm and applied it to robot navigation problems. PRVI improves the original PBVI and is superior to other POMDP algorithms. Experiments supported that PRVI significantly improved the efficiency.

机译：POMDP规划的确切值迭代是如此复杂，我们使用近似来解决实践中的问题。近年来，基于点的算法已成为一个研究热点。 PBVI算法选择后续的继承人，可以尽可能快地提高最坏情况密度。所有信仰点之间的间隙越小，值函数越快收敛到最佳解决方案。 PBVI每次扩展后都会使信仰的大小翻倍。指数增加使得该算法无法解决长视野的问题。然而，该集合中有一些点对密度几乎没有贡献。可以减少这些点以降低集合的大小。同时，扩展的点较少，并且在每次迭代期间可以执行更多备份。基于此，本文介绍了一个减少的POMDP值迭代算法，并将其应用于机器人导航问题。 PRVI改善了原来的PBVI，优于其他POMDP算法。实验支持PRVI显着提高了效率。

著录项

来源
《World Congress on Intelligent Control and Automation 》|2014年||共6页
会议地点
作者
Bo Du; Feng Liu; Zhihong Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论 ;
关键词
Partially observable Markov decision processes; Point-based Algorithms; Robot Navigation; RockSample Problem;

机译：部分观察到的马尔可夫决策过程;基于点的算法;机器人导航;Rocksample问题;

相似文献

外文文献
中文文献
专利

1. Decision-Theoretical Navigation of Service Robots Using POMDPs with Human-Robot Co-Occurrence Prediction [J] . Kun Qian, Xudong Ma, Xianzhong Dai, International Journal of Advanced Robotic Systems . 2017 ,第2期

机译：使用具有人机共生预测的POMDP的服务机器人决策理论导航
2. Decision-Theoretical Navigation of Service Robots Using POMDPs with Human-Robot Co-Occurrence Prediction [J] . Qian Kun, Ma Xudong, Dai Xianzhong, International Journal of Advanced Robotic Systems . 2013 ,第期

机译：使用POMDP与人机协同预测的服务机器人决策理论导航
3. A Navigation System for Assistant Robots Using Visually Augmented POMDPs [J] . MARIA ELENA LOPEZ, LUIS MIGUEL BERGASA, RAFAEL BAREA, Autonomous robots . 2005 ,第1期

机译：使用视觉增强的POMDP的辅助机器人导航系统
4. A point-reduced POMDP value iteration algorithm with application to robot navigation [C] . Bo Du, Feng Liu, Zhihong Zhao World Congress on Intelligent Control and Automation . 2014

机译：点降低POMDP值迭代算法及其在机器人导航中的应用
5. Comparative Performance Analysis of Navigation Algorithm and Deep Learning Application: Different Infrastructure and Cloud Robotics [D] . Bhaskaran, Divya. 2017

机译：导航算法与深度学习应用的比较性能分析：不同的基础架构和云机器人
6. Adaptive Iterated Extended Kalman Filter and Its Application to Autonomous Integrated Navigation for Indoor Robot [O] . Yuan Xu, Xiyuan Chen, Qinghua Li -1

机译：自适应迭代扩展卡尔曼滤波及其在室内机器人自主组合导航中的应用
7. Bayesian Reinforcement Learning in Continuous POMDPs with Application to Robot Navigation [O] . Stephane Ross et al. 2010

机译：连续pOmDp中的贝叶斯强化学习及其在机器人导航中的应用

A point-reduced POMDP value iteration algorithm with application to robot navigation

摘要

著录项

相似文献

相关主题

期刊订阅