Experience Replay for Real-Time Reinforcement Learning Control

Adam S.; Busoniu L.; Babuska R.

首页> 外文期刊>Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on >Experience Replay for Real-Time Reinforcement Learning Control

【24h】

Experience Replay for Real-Time Reinforcement Learning Control

机译：体验回放，用于实时强化学习控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement-learning (RL) algorithms can automatically learn optimal control strategies for nonlinear, possibly stochastic systems. A promising approach for RL control is experience replay (ER), which learns quickly from a limited amount of data, by repeatedly presenting these data to an underlying RL algorithm. Despite its benefits, ER RL has been studied only sporadically in the literature, and its applications have largely been confined to simulated systems. Therefore, in this paper, we evaluate ER RL on real-time control experiments that involve a pendulum swing-up problem and the vision-based control of a goalkeeper robot. These real-time experiments are complemented by simulation studies and comparisons with traditional RL. As a preliminary, we develop a general ER framework that can be combined with essentially any incremental RL technique, and instantiate this framework for the approximate Q-learning and SARSA algorithms. The successful real-time learning results that are presented here are highly encouraging for the applicability of ER RL in practice.

机译：强化学习（RL）算法可以为非线性的，可能是随机的系统自动学习最佳控制策略。 RL控制的一种有前途的方法是体验重播（ER），它可以通过将这些数据重复呈现给底层RL算法来从有限的数据中快速学习。尽管具有ER RL的优点，但文献中仅对ER RL进行了零星研究，其应用很大程度上局限于模拟系统。因此，在本文中，我们在涉及摆摆问题和守门员机器人基于视觉的控制的实时控制实验中评估了ER RL。这些实时实验辅以模拟研究以及与传统RL的比较。首先，我们开发了一个通用的ER框架，该框架可以与基本上任何增量RL技术结合使用，并为近似Q学习和SARSA算法实例化此框架。此处介绍的成功的实时学习结果对于ER RL在实践中的适用性非常令人鼓舞。

著录项

来源
《Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on》 |2012年第2期|p.201-212|共12页
作者
Adam S.; Busoniu L.; Babuska R.;
展开▼
作者单位

Large Corporates and Merchant Banking Division, ABN AMRO Bank, The Netherlands;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Experience replay (ER); Q-learning; SARSA; real-time control; reinforcement learning (RL); robotics;

机译：体验重播（ER）;Q学习;SARSA;实时控制;强化学习（RL）;机器人;

相似文献

外文文献
中文文献
专利

1. Real-time reinforcement learning by sequential Actor-Critics and experience replay. [J] . Wawrzynski P Neural Networks: The Official Journal of the International Neural Network Society . 2009,第10期

机译：通过连续的Actor-Critics进行实时强化学习，并体验回放。
2. Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems [J] . Hamidreza Modares, Frank L. Lewis, Mohammad-Bagher Naghibi-Sistani Automatica . 2014,第1期

机译：整体强化学习和经验重播，用于部分未知约束输入连续时间系统的自适应最优控制
3. Forgetful experience replay in hierarchical reinforcement learning from expert demonstrations [J] . Skrynnik Alexey, Staroverov Aleksey, Aitygulov Ermek, Knowledge-Based Systems . 2021,第Apra22期

机译：从专家演示中的分层强化学习中的健忘体验重播
4. Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) For Learning Multi-Goal, Continuous Action and State Space Controllers [C] . Andreas Gerken, Michael Spranger International Conference on Robotics and Automation . 2019

机译：用于学习多目标，连续动作和状态空间控制器的连续值迭代（CVI）强化学习和虚幻体验重放（IER）
5. Entropy-Based Experience Replay in Reinforcement Learning [D] . Dadvar, Mehdi. 2020

机译：基于熵的体验重播在加固学习中
6. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay [O] . Evan Prianto, MyeongSeop Kim, Jae-Han Park, 2020

机译：使用深度加强学习的多臂操纵器的路径规划：软演员 - 与后敏感体验重播
7. Hindsight Experience Replay Improves Reinforcement Learning for Control of a MIMO Musculoskeletal Model of the Human Arm [O] . Douglas C. Crowder, Jessica Abreu, Robert F. Kirsch 2021

机译：Hindsight体验重播改善了控制人类手臂MIMO肌肉骨骼模型的加固学习
8. Enhanced Experience Replay for Deep Reinforcement Learning. [R] . Doria, D., Dawson, B., Vindiola, M. 2015

机译：增强深度强化学习的体验重播。

Experience Replay for Real-Time Reinforcement Learning Control

摘要

著录项

相似文献

相关主题

期刊订阅