...
首页> 外文期刊>World Wide Web >Accelerated deep reinforcement learning with efficient demonstration utilization techniques
【24h】

Accelerated deep reinforcement learning with efficient demonstration utilization techniques

机译:高效示范利用技术加速了深度加强学习

获取原文
获取原文并翻译 | 示例
           

摘要

The use of demonstrations for deep reinforcement learning (RL) agents usually accelerates their training process as well as guides the agents to learn complicated policies. Most of the current deep RL approaches with demonstrations assume that there is a sufficient amount of high-quality demonstrations. However, for most real-world learning cases, the available demonstrations are often limited in terms of amount and quality. In this paper, we present an accelerated deep RL approach with dual replay buffer management and dynamic frame skipping on demonstrations. The dual replay buffer manager manages a human replay buffer and an actor replay buffer with independent sampling policies. We also propose dynamic frame skipping on demonstrations called DFS-ER (Dynamic Frame Skipping-Experience Replay) that learns the action repetition factor of the demonstrations. By implementing DFS-ER, we can accelerate deep RL by improving the efficiency of demonstration utilization, thereby yielding a faster exploration of the environment. We verified the training acceleration in three dense reward environments and one sparse reward environment compared to the conventional approach. In our evaluation using the Atari game environments, the proposed approach showed 21.7%-39.1% reduction in training iterations in a sparse reward environment.
机译:用于深度加强学习(RL)代理商的使用示范通常会加速其培训过程,并指导代理人学习复杂的政策。最目前的深入RL与演示的方法认为有足够的高质量示威。但是,对于大多数现实世界的学习案例,可用的演示通常在数量和质量方面有限。在本文中,我们介绍了一种加速的深度RL方法,双重重放缓冲管理和动态帧跳过演示。双重重放缓冲区管理器管理人类重放缓冲区和具有独立采样策略的演员重放缓冲区。我们还提出动态帧跳过显示DFS-er(动态帧跳过体验重放)的演示,了解示范的动作重复因子。通过实施DFS-er,我们可以通过提高示范利用效率来加速深度RL,从而产生更快的环境探索。与传统方法相比,我们验证了三个密集奖励环境中的培训加速度和一个稀疏的奖励环境。在我们使用Atari游戏环境的评估中,在稀疏奖励环境中培训迭代减少了21.7%-39.1%-39.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号