Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

机译：回到基础知识：基准展示Atari的典型演变策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Evolution Strategies (ES) have recently been demonstrated to be a viable alternative to reinforcement learning (RL) algorithms on a set of challenging deep RL problems, including Atari games and Mu-JoCo humanoid locomotion benchmarks. While the ES algorithms in that work belonged to the specialized class of natural evolution strategies (which resemble approximate gradient RL algorithms, such as REINFORCE), we demonstrate that even a very basic canonical ES algorithm can achieve the same or even better performance. This success of a basic ES algorithm suggests that the state-of-the-art can be advanced further by integrating the many advances made in the field of ES in the last decades. We also demonstrate qualitatively that ES algorithms have very different performance characteristics than traditional RL algorithms: on some games, they learn to exploit the environment and perform much better while on others they can get stuck in suboptimal local minima. Combining their strengths with those of traditional RL algorithms is therefore likely to lead to new advances in the state of the art.

机译：进化策略（ES）最近已被证明是一种可行的替代强化学习（RL）算法上的一组挑战深RL的问题，包括雅达利游戏和Mu-JOCO人形运动基准。虽然在工作中的ES算法属于专业类的自然进化策略（这类似于近似梯度RL算法，如加固），我们证明了即使是非常相同甚至更好的性能基本规范ES算法可以实现。一个基本的ES算法的这种成功表明，国家的最先进的可先进进一步通过整合在ES领域在过去几十年取得了许多进展。我们还演示了定性是ES算法比传统算法RL非常不同的性能特点：在一些游戏，他们学会利用这个环境下更好地执行，而对他人，他们可能会卡在次优的局部极小。与传统的RL算法结合自己的优势因此可能导致在现有技术的新进展。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2018年|698-1441p|共8页
会议地点
作者
Patryk Chrabaszcz; Ilya Loshchilov; Frank Hutter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Canonical transient receptor potential 1 plays a role in basic fibroblast growth factor (bFGF)/FGF receptor-1-induced Ca2+ entry and embryonic rat neural stem cell proliferation. [J] . Fiorio Pla A, Maric D, Brazer SC, The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2005,第10期

机译：典范的瞬态受体电位1在碱性成纤维细胞生长因子（bFGF）/ FGF受体1诱导的Ca2 +进入和胚胎大鼠神经干细胞增殖中起作用。
2. The Benchmarking Strategy Has a Role to Play Across Cultures [J] . Spilka Michael J., Dobson Keith S. Clinical psychology : . 2015,第1期

机译：标杆策略在跨文化中发挥作用
3. Playing on a Pathogen's Weakness: Using Evolution to Guide Sustainable Plant Disease Control Strategies [J] . Zhan Jiasui, Thrall Peter H., Papaix Julien, Annual Review of Phytopathology . 2015,第Null期

机译：发挥病原体的弱点：利用进化来指导可持续的植物病害控制策略
4. Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari [C] . Patryk Chrabaszcz, Ilya Loshchilov, Frank Hutter International Joint Conference on Artificial Intelligence . 2018

机译：回到基础知识：基准展示Atari的典型演变策略
5. Comparing particle swarms and evolution strategies: Benchmarks and application. [D] . Khemka, Namrata. 2006

机译：比较粒子群和进化策略：基准和应用。
6. Canonical Transient Receptor Potential 1 Plays a Role in Basic Fibroblast Growth Factor (bFGF)/FGF Receptor-1-Induced Ca2+ Entry and Embryonic Rat Neural Stem Cell Proliferation [O] . Alessandra Fiorio Pla, Dragan Maric, So-Ching Brazer, 2005

机译：典型的瞬态受体电位1在碱性成纤维细胞生长因子（bFGF）/ FGF受体1诱导的Ca 2+进入和胚胎大鼠神经干细胞增殖中起作用。
7. Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari [O] . Patryk Chrabąszcz, Ilya Loshchilov, Frank Hutter 2018

机译：回到基础知识：基准展示Atari的典型演变策略

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

摘要

著录项

相似文献

相关主题

期刊订阅