首页> 外文OA文献 >Learning macromanagement in starcraft from replays using deep learning
【2h】

Learning macromanagement in starcraft from replays using deep learning

机译:使用深度学习从重播中学习星际争霸中的宏管理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The real-time strategy game StarCraft has proven to be a challenging environment for artificial intelligence techniques, and as a result, current state-of-the-art solutions consist of numerous hand-crafted modules. In this paper, we show how macromanagement decisions in StarCraft can be learned directly from game replays using deep learning. Neural networks are trained on 789,571 state-action pairs extracted from 2,005 replays of highly skilled players, achieving top-1 and top-3 error rates of 54.6% and 22.9% in predicting the next build action. By integrating the trained network into UAlbertaBot, an open source StarCraft bot, the system can significantly outperform the game’s built-in Terran bot, and play competitively against UAlbertaBot with a fixed rush strategy. To our knowledge, this is the first time macromanagement tasks are learned directly from replays in StarCraft. While the best hand-crafted strategies are still the state-of-the-art, the deep network approach is able to express a wide range of different strategies and thus improving the network’s performance further with deep reinforcement learning is an immediately promising avenue for future research. Ultimately this approach could lead to strong StarCraft bots that are less reliant on hard-coded strategies.
机译:实时战略游戏《星际争霸》已经证明对人工智能技术而言是充满挑战的环境,因此,当前的最新解决方案由众多手工制作的模块组成。在本文中,我们展示了如何使用深度学习直接从游戏重播中学习《星际争霸》中的宏管理决策。神经网络接受了从高技能玩家的2,005次重播中提取的789,571个状态动作对的训练,在预测下一个构建动作时,前1位和前3位错误率分别为54.6%和22.9%。通过将训练有素的网络集成到开源StarCraft机器人UAlbertaBot中,该系统可以大大胜过游戏的内置Terran机器人,并可以采用固定的抢冲策略与UAlbertaBot竞争。据我们所知,这是第一次从StarCraft中的重放直接学习宏管理任务。尽管最佳的手工制定策略仍是最新技术,但深度网络方法能够表达各种不同的策略,因此,通过深度强化学习进一步改善网络性能是未来的直接希望之路。研究。最终,这种方法可能会导致强大的StarCraft机器人更少依赖硬编码策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号