首页> 外文会议>International Joint Conference on Artificial Intelligence >Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
【24h】

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

机译:回到基础知识:基准展示Atari的典型演变策略

获取原文

摘要

Evolution Strategies (ES) have recently been demonstrated to be a viable alternative to reinforcement learning (RL) algorithms on a set of challenging deep RL problems, including Atari games and Mu-JoCo humanoid locomotion benchmarks. While the ES algorithms in that work belonged to the specialized class of natural evolution strategies (which resemble approximate gradient RL algorithms, such as REINFORCE), we demonstrate that even a very basic canonical ES algorithm can achieve the same or even better performance. This success of a basic ES algorithm suggests that the state-of-the-art can be advanced further by integrating the many advances made in the field of ES in the last decades. We also demonstrate qualitatively that ES algorithms have very different performance characteristics than traditional RL algorithms: on some games, they learn to exploit the environment and perform much better while on others they can get stuck in suboptimal local minima. Combining their strengths with those of traditional RL algorithms is therefore likely to lead to new advances in the state of the art.
机译:进化策略(ES)最近已被证明是一种可行的替代强化学习(RL)算法上的一组挑战深RL的问题,包括雅达利游戏和Mu-JOCO人形运动基准。虽然在工作中的ES算法属于专业类的自然进化策略(这类似于近似梯度RL算法,如加固),我们证明了即使是非常相同甚至更好的性能基本规范ES算法可以实现。一个基本的ES算法的这种成功表明,国家的最先进的可先进进一步通过整合在ES领域在过去几十年取得了许多进展。我们还演示了定性是ES算法比传统算法RL非常不同的性能特点:在一些游戏,他们学会利用这个环境下更好地执行,而对他人,他们可能会卡在次优的局部极小。与传统的RL算法结合自己的优势因此可能导致在现有技术的新进展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号