Competitive Deep Reinforcement Learning over a Pokémon Battling Simulator

机译：通过神奇宝贝战斗模拟器进行的竞争性深度强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pokémon is one of the most popular video games in the world, and recent interest has appeared in Pokémon battling as a testbed for AI challenges. This is due to Pokémon battling showing interesting properties which contrast with current AI challenges over other video games. To this end, we implement a Pokémon Battle Environment, which preserves many of the core elements of Pokémon battling, and allows researchers to test isolated learning objectives. Our approach focuses on type advantage in Pokémon battles and on the advantages of delayed rewards through switching, which is considered core strategies for any Pokémon battle. As a competitive multi-agent environment, it has a partially-observable, high-dimensional, and continuous state-space, adheres to the Gym de facto standard reinforcement learning interface, and is performance-oriented, achieving thousands of interactions per second in commodity hardware. We determine whether deep competitive reinforcement learning algorithms, WPLθ and GIGAθ, can learn successful policies in this environment. Both converge to rational and effective strategies, and GIGAθ shows faster convergence, obtaining a 100% win-rate in a disadvantageous test scenario.

机译：神奇宝贝是世界上最受欢迎的视频游戏之一，最近对于神奇宝贝战斗的兴趣也逐渐显现出来，以此作为应对AI挑战的试验台。这是由于《神奇宝贝》的战斗表现出了有趣的特性，这与当前AI挑战其他视频游戏形成了鲜明的对比。为此，我们实施了一个神奇宝贝战斗环境，该环境保留了神奇宝贝战斗的许多核心要素，并允许研究人员测试孤立的学习目标。我们的方法侧重于神奇宝贝战斗中的类型优势，以及通过切换获得延迟奖励的优势，这被认为是任何神奇宝贝战斗的核心策略。作为竞争性的多主体环境，它具有部分可观察的，高维且连续的状态空间，并遵循Gym de facto标准的强化学习界面，并且以性能为导向，每秒可实现数千次商品交互硬件。我们确定深度竞争性强化学习算法WPLθ和GIGAθ是否可以在这种环境下学习成功的策略。两者都收敛于合理有效的策略，并且GIGAθ显示出更快的收敛速度，在不利的测试场景中获得了100％的获胜率。

著录项

来源
《IEEE International Conference on Autonomous Robot Systems and Competitions》|2020年|40-45|共6页
会议地点
作者
David Simões; Simão Reis; Nuno Lau; Luís Paulo Reis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Games; Machine learning; Switches; Conferences; Learning (artificial intelligence); Convergence;

机译：游戏;机器学习;开关;会议;学习（人工智能）;融合;
入库时间 2022-08-26 14:47:21

相似文献

外文文献
中文文献
专利

1. Beyond Pok??mon: Augmented Reality Is a Universal Design for Learning Tool: [J] . Zachary Walker, Don D. McMahon, Kara Rosenblatt, SAGE Open . 2017,第4期

机译：超越精灵：增强现实是学习工具的通用设计：
2. Open Source Robotic Simulators Platforms for Teaching Deep Reinforcement Learning Algorithms [J] . Armando Plasencia, Yulia Shichkina, Ileana Suárez, Procedia Computer Science . 2019,第5期

机译：开源机器人模拟器平台，用于教学深层加固学习算法
3. Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning [J] . You Changxi, Lu Jianbo, Filev Dimitar, Robotics and Autonomous Systems . 2019,第期

机译：利用强化学习和深度逆钢筋学习的自治车辆先进规划
4. Competitive Deep Reinforcement Learning over a Pokémon Battling Simulator [C] . David Sim?es, Sim?o Reis, Nuno Lau, IEEE International Conference on Autonomous Robot Systems and Competitions . 2020

机译：通过神奇宝贝作战模拟器的竞争性深度加强学习
5. “Being Cute and Hella Gay:" Pokémon Reborn, Fan Labor, and Queering the Pokémon World [D] . Kocik, David. 2020

机译：“是可爱和hella同性恋：”神奇宝贝重生，粉丝劳动力，并衡量神奇宝贝世界
6. Pokémon GO! GO! GO! The impact of Pokémon GO on physical activity and related health outcomes [O] . Yaoyue Li, Yuanchen Liu, Lei Ye, 2021

机译：神奇宝贝去！走！走！神奇宝贝对体育活动和相关健康结果的影响
7. Competitive Multi-agent Deep Reinforcement Learning with Counterfactual Thinking [O] . Yue Wang, Yao Wan, Chenwei Zhang, 2019

机译：反事实思维有竞争力的多代理深增强学习

Competitive Deep Reinforcement Learning over a Pokémon Battling Simulator

摘要

著录项

相似文献

相关主题

期刊订阅