Improving RTS Game AI by Supervised Policy Learning, Tactical Search, and Deep Reinforcement Learning

Barriga Nicolas A.; Stanescu Marius; Besoain Felipe; Buro Michael

首页> 外文期刊>IEEE computational intelligence magazine >Improving RTS Game AI by Supervised Policy Learning, Tactical Search, and Deep Reinforcement Learning

【24h】

Improving RTS Game AI by Supervised Policy Learning, Tactical Search, and Deep Reinforcement Learning

机译：通过监督策略学习，战术搜索和深度强化学习来改善RTS Game AI

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Constructing strong AI systems for video games is difficult due to enormous state and action spaces and the lack of good state evaluation functions and high-level action abstractions. In spite of recent research progress in popular video game genres such as Atari 2600 console games and multiplayer online battle arena (MOBA) games, to this day strong human players can still defeat the best AI systems in adversarial video games. In this paper, we propose to use a deep Convolutional Neural Network (CNN) to select among a limited set of abstract action choices in Real-Time Strategy (RTS) games, and to utilize the remaining computation time for game tree search to improve low-level tactics. The CNN is trained by supervised learning on game states labeled by Puppet Search, a strategic search algorithm that uses action abstractions. Replacing Puppet Search by a CNN frees up time that can be used for improving units' tactical behavior while executing the strategic plan. Experiments in the mu RTS game show that the combined algorithm results in higher win-rates than either of its two independent components and other state-of-the-art mu RTS agents. We then present a case-study that investigates how deep Reinforcement Learning (RL) can be used in modern video games, such as Total War: Warhammer, to improve tactical multi-agent AI modules. We use popular RL algorithms such as Deep-Q Networks (DQN) and Asynchronous AdvantageActor Critic (A3C), basic network architectures and minimal hyper-parameter tuning to learn complex cooperative behaviors that defeat the highest difficulty built-in AI in mediumscale scenarios.

机译：由于巨大的状态和动作空间以及缺乏良好的状态评估功能和高级动作抽象，为视频游戏构建强大的AI系统非常困难。尽管最近流行的视频游戏类型（如Atari 2600主机游戏和多人在线战斗竞技场（MOBA）游戏）取得了研究进展，但迄今为止，强大的人类玩家仍然可以击败对抗性视频游戏中最好的AI系统。在本文中，我们建议使用深度卷积神经网络（CNN）在实时策略（RTS）游戏中的一组有限的抽象动作选择中进行选择，并利用剩余的计算时间进行游戏树搜索以降低级战术。 CNN通过监督学习来训练以Puppet Search标记的游戏状态，Puppet Search是一种使用动作抽象的战略搜索算法。用CNN取代人偶搜索可以节省时间，这些时间可用于在执行战略计划时改善部队的战术行为。在mu RTS游戏中进行的实验表明，结合后的算法比其两个独立组件和其他最新的mu RTS代理中的任何一个都具有更高的获胜率。然后，我们提供一个案例研究，研究如何在诸如Total War：Warhammer之类的现代视频游戏中使用深度强化学习（RL）来改进战术多主体AI模块。我们使用诸如Deep-Q Networks（DQN）和Asynchronous AdvantageActor Critic（A3C）等流行的RL算法，基本的网络体系结构和最小化的超参数调整来学习复杂的协作行为，以克服中等规模场景中内置AI的最高难度。

著录项

来源
《IEEE computational intelligence magazine》 |2019年第3期|8-18|共11页
作者
Barriga Nicolas A.; Stanescu Marius; Besoain Felipe; Buro Michael;
展开▼
作者单位

Univ Talca Sch Videogames Dev & VR Engn Talca Chile;

Univ Alberta Dept Comp Sci Edmonton AB Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Improving Maneuver Strategy in Air Combat by Alternate Freeze Games with a Deep Reinforcement Learning Algorithm [J] . Zhuang Wang, Hui Li, Haolin Wu, Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：用深度加强学习算法改进空战中的机动策略
2. Application of the reinforcement learning for selecting fuzzy rules representing the behavior policy of units in RTS-type games [J] . Maciej KUTYLO, Marcin PLUCINSKI, Magdalena LASKOWSKA Przeglad Elektrotechniczny . 2015,第2期

机译：强化学习在选择代表RTS型游戏单位行为策略的模糊规则中的应用
3. Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game [J] . Patel Devdhar, Hazan Hananel, Saunders Daniel J., Neural Networks: The Official Journal of the International Neural Network Society . 2019,第1期

机译：改进了加强学习政策的鲁棒性，转换为应用于Atari Breakout游戏的尖峰神经元网络平台
4. Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning [C] . Shohei Miyashita, Xinyu Lian, Xiao Zeng, IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing . 2017

机译：通过混合强化学习和监督学习来开发行为类似于人类的游戏AI代理
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. The advent of AI and deep learning in diagnostics and imaging: Machine learning systems have potential to improve diagnostics in healthcare and imaging systems in research [O] . Philip Hunter 2019

机译：AI的出现和诊断和成像的深度学习：机器学习系统有可能改善医疗保健和成像系统的诊断
7. Deep RTS: A Game Environment for Deep Reinforcement Learning in Real-Time Strategy Games [O] . Per-Arne Andersen, Morten Goodwin, Ole-Christoffer Granmo 2018

机译：深射击：实时战略游戏中深度加强学习的游戏环境

Improving RTS Game AI by Supervised Policy Learning, Tactical Search, and Deep Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅