Coarse planning for landmark navigation in a neural-network reinforcement-learning robot

机译：神经网络强化学习机器人中地标导航粗略规划

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Is it possible to plan at a coarse level and act at a fine level with a neural-network (NN) reinforcement-learning (RL) planner? This work presents a NN planner, used to control a simulated robot in a stochastic landmark-navigation problem, which plans at an abstract level. The controller has both reactive components, based on actor-critic RL, and planning components inspired by the Dyna-PI architecture (this roughly corresponds to RL plus a model of the environment). Coarse planning is based on macro-actions defined as a sequence of identical primitive actions. It updates the evaluations and the action policy while generating simulated experience at the macro level with the model of the environment (a NN trained at the macro level). The simulations show how the controller works. They also show the advantages of using a discount coefficient tuned to the level of planning coarseness, and suggest that discounted RL has problems in dealing with long periods of time.

机译：是否有可能以粗略的水平计划，并用神经网络（NN）加强学习（RL）策划仪在良好水平上行动？这项工作提供了一个NN规划人员，用于控制在随机地标导航问题中的模拟机器人，在抽象水平中计划。控制器具有基于演员 - 评论仪RL的反应性分量，以及由Dyna-PI架构启发的计划组件（这大致对应于RL加上环境模型）。粗略规划是基于宏动作定义为相同的原始操作序列。它更新评估和行动策略，同时使用环境模型生成宏观的模拟体验（在宏级别训练的NN）。模拟显示控制器的工作原理。他们还展示了使用折扣系数调整到规划粗糙度水平的优势，并表明折扣RL在处理长时间处理时存在问题。

著录项

来源
《IEEE/RSJ International Workshop on Intelligent Robots and Systems》|2001年||共6页
会议地点
作者
Baldassarre G.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Integration of Cell-Mapping and Reinforcement-Learning Techniques for Motion Planning of Car-Like Robots [J] . Gomez Plaza M., Martinez-Marin T., Prieto S.S., Instrumentation and Measurement, IEEE Transactions on . 2009,第9期

机译：单元映射和强化学习技术的集成，用于类似汽车的机器人的运动计划
2. A biologically inspired navigation concept based on the Landmark-Tree map for efficient long-distance robot navigation [J] . Elmar Mair, Marcus Augustine, Bastian Jager, Advanced Robotics: The International Journal of the Robotics Society of Japan . 2014,第5a6期

机译：基于地标树地图的生物启发式导航概念，可实现高效的长距离机器人导航
3. Localization and navigation using a novel artificial landmark for indoor mobile robots [J] . Wen Feng, Yuan Kui, Zheng Rui 高技术通讯（英文版） . 2009,第003期

机译：使用新型人工地标为室内移动机器人进行定位和导航
4. Coarse planning for landmark navigation in a neural-network reinforcement-learning robot [C] . Baldassarre G. Intelligent Robots and Systems, 2001. Proceedings. 2001 IEEE/RSJ International Conference on . 2001

机译：在神经网络强化学习机器人中进行地标导航的粗规划
5. Effectiveness of Closed-Loop Inverse-Kinematic Landmark Navigation Method on Drifting and Heading Errors of Imprecise Multi-Legged Robots [D] . Nguyen, Thong Quang. 2021

机译：闭环逆运动地标导航方法对不精确多腿机器人漂移和出头误差的有效性
6. Validation of a Dynamic Planning Navigation Strategy Applied to Mobile Terrestrial Robots [O] . Caroline A. D. Silva, Átila V. F. M. de Oliveira, Marcelo A. C. Fernandes 2018

机译：应用于移动地面机器人的动态规划导航策略的验证
7. Coarse Planning for Landmark Navigation in a Neural-Network Reinforcement-Learning Robot [O] . Gianluca Baldassarre 2001

机译：神经网络强化学习机器人中地标导航的粗略规划

Coarse planning for landmark navigation in a neural-network reinforcement-learning robot

摘要

著录项

相似文献

相关主题

期刊订阅