Recency-Weighted Acceleration for Continuous Control Through Deep Reinforcement Learning

机译：通过深度加强学习连续控制的新加权加速

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Model-free reinforcement learning algorithms have been successfully applied to continuous control tasks. However, these algorithms suffer from severe instability and high sample complexity. Inspired by Averaged-DQN, this paper proposes a recency-weighted target estimator for actor-critic settings, which will construct a target estimator with more weight placed on recently learned value functions, obtaining a more stable and accurate value estimator. Besides, delaying policy updates with more flexible control is adopted to reduce per-update error because of value function errors. Furthermore, to improve the performance of prioritized experience replay (PER) for continuous control tasks, Phased-PER is proposed to accelerate training in different periods. Experimental results are given to demonstrate that using the same hyper-parameters and architecture the proposed algorithm is more robust and achieves better performance, surpassing the existing methods on a range of continuous control benchmark tasks.

机译：无模型增强学习算法已成功应用于连续控制任务。然而，这些算法患有严重的不稳定性和高样本复杂性。灵感来自平均DQN，本文提出了一种用于演员 - 批评设置的新加权目标估计，它将构建更高权重的目标估计器，以便在最近学识到的值函数上，获得更稳定和准确的值估计器。此外，由于价值函数错误，采用具有更灵活控制的延迟策略更新以减少每次更新错误。此外，为了改善连续控制任务的优先经验重放（PER）的性能，提出了分阶段，以加速不同时期的培训。给出了实验结果表明，使用相同的超参数和架构，所提出的算法更加强大，实现更好的性能，超越了一系列连续控制基准任务的现有方法。

著录项

来源
《International Conference on Neural Information Processing》|2020年|604-615|共12页
会议地点
作者
Zhen Wu; Zongzhang Zhang; Xiaofang Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep reinforcement learning; Value estimation; Delayed policy updates; Prioritized experience replay;

机译：深增强学习;价值估计;延迟政策更新;优先经验重放;

相似文献

外文文献
中文文献
专利

1. Deep Reinforcement Learning for Continuous-time Self-triggered Control ? [J] . Ran Wang, Ibuki Takeuchi, Kenji Kashima IFAC PapersOnLine . 2021,第14期

机译：持续时间自触发控制的深度加强学习？
2. Continuous control with Stacked Deep Dynamic Recurrent Reinforcement Learning for portfolio optimization [J] . Aboussalah Amine Mohamed, Lee Chi-Guhn Expert Systems with Application . 2020,第Feba期

机译：利用堆叠式深度动态递归强化学习进行持续控制，以优化产品组合
3. Continuous drone control using deep reinforcement learning for frontal view person shooting [J] . Neural computing & applications . 2020,第9期

机译：使用深度钢筋学习对正面观众射击的连续无人机控制
4. Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning [C] . Hamid Mirzaei, Tony Givargis 2017 IEEE SmartWorld, Ubiquitous Intelligence amp; Computing, Advanced amp; Trusted Computed, Scalable Computing amp; Communications, Cloud amp; Big Data Computing, Internet of People and Smart City Innovation . 2017

机译：使用深度强化学习的自动交叉口管理细粒度加速控制
5. Deep Learning and Reinforcement Learning for Inventory Control [D] . Khanidahaj, Zahra. 2018

机译：库存控制深度学习和加固学习
6. Sensors Integrated Control of PEMFC Gas Supply System Based on Large-Scale Deep Reinforcement Learning [O] . Jiawen Li, Tao Yu 2021

机译：基于大型深度增强学习的PEMFC气体供应系统的传感器集成控制
7. Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning [O] . Mirzaei, Hamid, Givargis, Tony 2017

机译：用于自主交叉管理的细粒度加速控制使用深层强化学习

Recency-Weighted Acceleration for Continuous Control Through Deep Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅