A point-reduced POMDP value iteration algorithm with application to robot navigation

机译：点降低POMDP值迭代算法及其在机器人导航中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The exact value iteration for POMDP planning is so complex that we use approximation to solve the problems in practice. In recent years, point-based algorithm has become a research hotspot. PBVI algorithm selects successors that improve the worst case density as rapidly as possible. The smaller the gaps between all belief points, the faster the value function converges to the optimal solutions. PBVI doubles the size of the belief set after each expansion. The exponential increase makes this algorithm incapable to solve problems with long horizons. However, there are some points in the set which have little contribution to the density. These points can be reduced to decrease the size of the set. Meanwhile, fewer points are expanded and more backups can be executed during each iteration. Based on this, this paper introduces a point-reduced POMDP value iteration algorithm and applied it to robot navigation problems. PRVI improves the original PBVI and is superior to other POMDP algorithms. Experiments supported that PRVI significantly improved the efficiency.

机译：POMDP规划的精确值迭代是如此复杂，以至于我们使用近似值来解决实际问题。近年来，基于点的算法已成为研究热点。 PBVI算法选择后继者，以尽快提高最坏情况下的密度。所有置信点之间的间隙越小，价值函数收敛到最优解的速度就越快。每次扩展后，PBVI会将信念集的大小加倍。指数级增长使得该算法无法解决长远问题。但是，集合中的某些点对密度的贡献很小。可以减少这些点以减小集合的大小。同时，扩展更少的点，并且在每次迭代期间可以执行更多的备份。在此基础上，本文提出了一种点减少的POMDP值迭代算法，并将其应用于机器人导航问题。 PRVI改进了原始的PBVI，并且优于其他POMDP算法。实验支持PRVI大大提高了效率。

著录项

来源
《World Congress on Intelligent Control and Automation》|2014年|1427-1432|共6页
会议地点
作者
Bo Du; Feng Liu; Zhihong Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Partially observable Markov decision processes; Point-based Algorithms; Robot Navigation; RockSample Problem;

机译：部分可观察的Markov决策过程;基于点的算法;机器人导航; RockSample问题;

相似文献

外文文献
中文文献
专利

1. Decision-Theoretical Navigation of Service Robots Using POMDPs with Human-Robot Co-Occurrence Prediction [J] . Kun Qian, Xudong Ma, Xianzhong Dai, International Journal of Advanced Robotic Systems . 2017,第2期

机译：使用具有人机共生预测的POMDP的服务机器人决策理论导航
2. Decision-Theoretical Navigation of Service Robots Using POMDPs with Human-Robot Co-Occurrence Prediction [J] . Qian Kun, Ma Xudong, Dai Xianzhong, International Journal of Advanced Robotic Systems . 2013,第期

机译：使用POMDP与人机协同预测的服务机器人决策理论导航
3. A Navigation System for Assistant Robots Using Visually Augmented POMDPs [J] . MARIA ELENA LOPEZ, LUIS MIGUEL BERGASA, RAFAEL BAREA, Autonomous robots . 2005,第1期

机译：使用视觉增强的POMDP的辅助机器人导航系统
4. A point-reduced POMDP value iteration algorithm with application to robot navigation [C] . Bo Du, Feng Liu, Zhihong Zhao World Congress on Intelligent Control and Automation . 2014

机译：一种点减少的POMDP值迭代算法应用于机器人导航
5. Comparative Performance Analysis of Navigation Algorithm and Deep Learning Application: Different Infrastructure and Cloud Robotics [D] . Bhaskaran, Divya. 2017

机译：导航算法与深度学习应用的比较性能分析：不同的基础架构和云机器人
6. Adaptive Iterated Extended Kalman Filter and Its Application to Autonomous Integrated Navigation for Indoor Robot [O] . Yuan Xu, Xiyuan Chen, Qinghua Li -1

机译：自适应迭代扩展卡尔曼滤波及其在室内机器人自主组合导航中的应用
7. Bayesian Reinforcement Learning in Continuous POMDPs with Application to Robot Navigation [O] . Stephane Ross et al. 2010

机译：连续pOmDp中的贝叶斯强化学习及其在机器人导航中的应用

A point-reduced POMDP value iteration algorithm with application to robot navigation

摘要

著录项

相似文献

相关主题

期刊订阅