首页> 外文会议>IEEE Conference on Computational Intelligence and Games >Learning macromanagement in starcraft from replays using deep learning
【24h】

Learning macromanagement in starcraft from replays using deep learning

机译:在使用深度学习的重播中学习星际争的宏观管理

获取原文

摘要

The real-time strategy game StarCraft has proven to be a challenging environment for artificial intelligence techniques, and as a result, current state-of-the-art solutions consist of numerous hand-crafted modules. In this paper, we show how macromanagement decisions in StarCraft can be learned directly from game replays using deep learning. Neural networks are trained on 789,571 state-action pairs extracted from 2,005 replays of highly skilled players, achieving top-1 and top-3 error rates of 54.6% and 22.9% in predicting the next build action. By integrating the trained network into UAlbertaBot, an open source StarCraft bot, the system can significantly outperform the game's built-in Terran bot, and play competitively against UAlbertaBot with a fixed rush strategy. To our knowledge, this is the first time macromanagement tasks are learned directly from replays in StarCraft. While the best hand-crafted strategies are still the state-of-the-art, the deep network approach is able to express a wide range of different strategies and thus improving the network's performance further with deep reinforcement learning is an immediately promising avenue for future research. Ultimately this approach could lead to strong StarCraft bots that are less reliant on hard-coded strategies.
机译:实时策略游戏星际争霸已被证明是人工智能技术的具有挑战性的环境,因此,目前最先进的解决方案包括许多手工制作的模块。在本文中,我们展示了星际争霸中的宏互换决策如何直接从使用深度学习的游戏重放来学习。神经网络在789,571状态 - 动作对中培训,从高技能播放器的2,005个重播中提取,在预测下一个构建动作中,实现了54.6 %和22.9 %的前1个和前3个错误率。通过将培训的网络集成到Ualbertabot,一个开源星形争霸BOT,系统可以显着优于游戏的内置地机器人,并竞争地对抗Ualbertabot以固定的匆忙策略。为了我们的知识,这是第一次直接从星际争霸中的重播中学到的宏观管理任务。虽然最好的手工制作的策略仍然是最先进的,但深度网络方法能够表达各种不同的策略,从而提高网络的性能,进一步加强深度加强学习是一个立即承诺的未来的大道研究。最终,这种方法可能导致强烈的星际争霸机器人依赖于硬编码策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号