Improving reinforcement learning through a better exploration strategy and an adjustable representationof the environment

机译：通过更好的探索策略和可调整的环境表示来改善强化学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning is a promising strategy as all the robot needs to start a random search of the desired solution is a reinforcement function which specifies the main restrictions of the behaviour. Nevertheless, the robot wastes too much time trying the execution of random -mostly wrong- actions, and the user is forced to determine the balance between the exploration of new actions and the execution of already tried ones. In this context we propose a methodology which is able to achieve fast convergences towards good robot-control policies, and it determines on its own the required degree of exploration at every instant. The performance of our approach is due to the mutual and dynamic influence that three different elements exert on each other: reinforcement learning, genetic algorithms, and a dynamic representation of the environment around the robot. In this paper we describe the application of our approach to solve two common tasks in mobile robotics (wall following and door traversal). The experimental results show how the required learning time is significantly reduced and the stability of the process is increased. On the other hand, the low user-intervention required to solve both tasks -only the reinforcement function is changed-, confirms the contribution of this approach towards robot techniques that are fast, user friendly, and demand little application-specific knowledge by the user, something more and more required nowadays.

机译：强化学习是一种有前途的策略，因为机器人开始对所需解决方案进行随机搜索所需的所有功能都是强化功能，它指定了行为的主要限制。然而，机器人浪费了太多时间来尝试执行随机的-大多数是错误的动作-并且用户被迫确定探索新动作与执行已经尝试过的动作之间的平衡。在这种情况下，我们提出了一种方法，该方法能够快速朝着良好的机器人控制策略收敛，并在每个瞬间自行确定所需的探索程度。我们的方法的性能归因于三个不同元素相互施加的相互影响和动态影响：强化学习，遗传算法以及机器人周围环境的动态表示。在本文中，我们描述了该方法在解决移动机器人中的两个常见任务（墙跟随和门遍历）中的应用。实验结果表明如何显着减少所需的学习时间并提高过程的稳定性。另一方面，解决这两个任务所需的用户干预很低-仅更改了增强功能-证实了这种方法对快速，用户友好且用户几乎不需要特定于应用程序的知识的机器人技术的贡献，如今越来越需要一些东西。

著录项

来源
《Proceedings of the 3rd European Conference on Mobile Robots》|2007年|P.13-18|共6页
会议地点 Freiburg(DE);Freiburg(DE)
作者
R. Iglesias; M. Rodriguez; M. Sanchez; E. Pereira; C.V. Regueiro;
展开▼
作者单位

SICK Sensor Intelligence.;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类机器人技术;
关键词
reinforcement learning; robot control; autonomous agents; genetic algorithms;

机译：强化学习机器人控制自主智能遗传算法;

相似文献

外文文献
中文文献
专利

1. Learning Exploration/Exploitation Strategies for Single Trajectory Reinforcement Learning [J] . Damien Ernst, Francis Maes, Michael Castronovo, JMLR: Workshop and Conference Proceedings . 2012,第2012期

机译：单轨强化学习的学习探索/开发策略
2. Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction [J] . Yoshihiro Okawa, Tomotake Sasaki, Hidenao Iwane IFAC PapersOnLine . 2020,第2期

机译：与联合机会约束满足的安全强化学习自动勘探过程调整
3. Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment [J] . Li Haoran, Zhang Qichao, Zhao Dongbin Neural Networks and Learning Systems, IEEE Transactions on . 2020,第6期

机译：基于深度加强学习的自动探索未知环境中的导航
4. Improving reinforcement learning through a better exploration strategy and an adjustable representationof the environment [C] . R. Iglesias, M. Rodriguez, M. Sanchez, European Conference on Mobile Robots . 2007

机译：通过更好的探索策略和环境的可调代表来改善强化学习
5. Reputation-oriented reinforcement learning strategies for economically-motivated agents in electronic market environments. [D] . Tran, Thomas Thanh. 2004

机译：电子市场环境中针对经济动机的代理商的以信誉为导向的强化学习策略。
6. An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A2A receptor [O] . Xuhan Liu, Kai Ye, Herman W. T. van Vlijmen, 2019

机译：探索策略通过深度强化学习来改善从头配体的多样性：腺苷A2A受体的情况
7. Learning to soar: exploration strategies in reinforcement learning for resource-constrained missions [O] . Chung Jen Jen 2014

机译：学习飞涨：资源受限任务的强化学习探索策略

Improving reinforcement learning through a better exploration strategy and an adjustable representationof the environment

摘要

著录项

相似文献

相关主题

期刊订阅