Designing Curriculum for Deep Reinforcement Learning in StarCraft Ⅱ

机译：临时争制深增强学习课程Ⅱ

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) has proven successful in games, but suffers from long training times when compared to other forms of machine learning. Curriculum learning, an optimisation technique that improves a model's ability to learn by presenting training samples in a meaningful order, known as curricula, could offer a solution. Curricula are usually designed manually, due to limitations involved with automating curricula generation. However, as there is a lack of research into effective design of curricula, researchers often rely on intuition and the resulting performance can vary. In this paper, we explore different ways of manually designing curricula for RL in real-time strategy game StarCraft Ⅱ. We propose four generalised methods of manually creating curricula and verify their effectiveness through experiments. Our results show that all four of our proposed methods can improve a RL agent's learning process when used correctly. We demonstrate that using subtasks, or modifying the state space of the tasks, is the most effective way to create training samples for StarCraft Ⅱ. We found that utilising subtasks during training consistently accelerated the learning process of the agent and improved the agent's final performance.

机译：加固学习（RL）已成功在游戏中取得成功，但与其他形式的机器学习相比，患有长期培训时间。课程学习，一种优化技术，通过以有意义的顺序呈现培训样本来提高模型学习的能力，称为课程，可以提供解决方案。由于自动化课程生成所涉及的限制，课程通常是手动设计的。然而，由于缺乏对课程的有效设计的研究，研究人员经常依赖直觉，由此产生的表现可能会有所不同。在本文中，我们在实时策略游戏星际争霸中探讨了用于RL的课程的不同方式。我们提出了四种手动创建课程的一般方法，并通过实验验证其有效性。我们的研究结果表明，我们所提出的四种方法可以在正确使用时改善RL代理的学习过程。我们演示使用子任务或修改任务的状态空间，是创建星际争霸Ⅱ的培训样本的最有效方法。我们发现在培训期间利用子特提比始终加速了代理人的学习过程，并改善了代理人的最终表现。

著录项

来源
《Australasian joint conference on artificial intelligence》|2020年|243-255|共13页
会议地点
作者
Daniel Hao; Penny Sweetser; Matthew Aitchison;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Game AI; Reinforcement learning; Real-time strategy games; StarCraft Ⅱ; Curriculum learning;

机译：游戏ai;加强学习;实时战略游戏;星际争霸Ⅱ;课程学习;
入库时间 2022-08-26 13:58:22

相似文献

外文文献
中文文献
专利

1. StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning [J] . Kun Shao, Yuanheng Zhu, Dongbin Zhao IEEE Transactions on Emerging Topics in Computational Intelligence . 2019,第1期

机译：具有强化学习和课程转移学习功能的《星际争霸》微管理
2. Grandmaster level in StarCraft Ⅱ using multi-agent reinforcement learning [J] . Vinyals Oriol, Babuschkin Igor, Czarnecki Wojciech M., Nature . 2019,第7782期

机译：使用多主体强化学习的《星际争霸Ⅱ》中的大师级
3. Designing Deep Reinforcement Learning for Human Parameter Exploration [J] . Scurto Hugo, Van Kerrebroeck Bavo, Caramiaux Baptiste, ACM Transactions on Computer-Human Interaction . 2021,第1期

机译：设计人体参数探索的深层加固学习
4. Designing Curriculum for Deep Reinforcement Learning in StarCraft II [C] . Daniel Hao, Penny Sweetser, Matthew Aitchison Australasian Joint Conference on Artificial Intelligence . 2020

机译：三星争霸II中深增强学习的设计课程
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Cooperative Object Transportation Using Curriculum-Based Deep Reinforcement Learning [O] . Gyuho Eoh, Tae-Hyoung Park 2021

机译：基于课程的深度加强学习的合作对象运输
7. Learning Macromanagement in Starcraft by Deep Reinforcement Learning [O] . Wenzhen Huang, Qiyue Yin, Junge Zhang, 2021

机译：深度加强学习学习星际争霸中的宏观管理

Designing Curriculum for Deep Reinforcement Learning in StarCraft Ⅱ

摘要

著录项

相似文献

相关主题

期刊订阅