首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Efficient Practice for Deep Reinforcement Learning
【24h】

Efficient Practice for Deep Reinforcement Learning

机译:深度强化学习的有效实践

获取原文

摘要

Deep reinforcement learning has demonstrated its ability to solve diverse hard problems, which were not able to be solved previously. However, one of the main drawbacks is the required long training time for a task. Furthermore, trial-and-error in reinforcement learning is often inappropriate in most real-world tasks. There were some discussions on transfer learning, specifically sim-to-real transfer, but the design of new environments or defining an environment with similar or relevant goals require tedious human efforts. This research aims to leverage practice approaches, which do not require a new environmental design for transfer learning, to shorten the time for training a deep reinforcement learning agent and to achieve an optimal policy for the maximum expected rewards. We verify the efficacy of existing practice approach with Deep Q networks (DQN) in Atari games and also introduce a novel practice approach involving iterations of short periods of practice and reinforcement learning, to further improve the performance of an agent.
机译:深度强化学习证明了其解决各种以前无法解决的难题的能力。但是,主要缺点之一是任务需要很长的训练时间。此外,在大多数实际任务中,强化学习中的反复试验通常是不合适的。关于转移学习,特别是从模拟到真实的转移,进行了一些讨论,但是新环境的设计或定义具有相似或相关目标的环境需要繁琐的人工工作。这项研究旨在利用实践方法,该方法不需要为迁移学习设计新的环境,从而缩短了培训深度强化学习代理的时间,并实现了获得最大预期收益的最佳策略。我们使用Atari游戏中的Deep Q网络(DQN)验证了现有练习方法的有效性,并且还引入了一种新颖的练习方法,该方法涉及短期练习的迭代和强化学习,以进一步提高代理的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号