首页> 外文OA文献 >Learning macromanagement in starcraft from replays using deep learning

【2h】

Learning macromanagement in starcraft from replays using deep learning

机译：使用深度学习从重播中学习星际争霸中的宏管理

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The real-time strategy game StarCraft has proven to be a challenging environment for artificial intelligence techniques, and as a result, current state-of-the-art solutions consist of numerous hand-crafted modules. In this paper, we show how macromanagement decisions in StarCraft can be learned directly from game replays using deep learning. Neural networks are trained on 789,571 state-action pairs extracted from 2,005 replays of highly skilled players, achieving top-1 and top-3 error rates of 54.6% and 22.9% in predicting the next build action. By integrating the trained network into UAlbertaBot, an open source StarCraft bot, the system can significantly outperform the game’s built-in Terran bot, and play competitively against UAlbertaBot with a fixed rush strategy. To our knowledge, this is the first time macromanagement tasks are learned directly from replays in StarCraft. While the best hand-crafted strategies are still the state-of-the-art, the deep network approach is able to express a wide range of different strategies and thus improving the network’s performance further with deep reinforcement learning is an immediately promising avenue for future research. Ultimately this approach could lead to strong StarCraft bots that are less reliant on hard-coded strategies.

机译：实时战略游戏《星际争霸》已经证明对人工智能技术而言是充满挑战的环境，因此，当前的最新解决方案由众多手工制作的模块组成。在本文中，我们展示了如何使用深度学习直接从游戏重播中学习《星际争霸》中的宏管理决策。神经网络接受了从高技能玩家的2,005次重播中提取的789,571个状态动作对的训练，在预测下一个构建动作时，前1位和前3位错误率分别为54.6％和22.9％。通过将训练有素的网络集成到开源StarCraft机器人UAlbertaBot中，该系统可以大大胜过游戏的内置Terran机器人，并可以采用固定的抢冲策略与UAlbertaBot竞争。据我们所知，这是第一次从StarCraft中的重放直接学习宏管理任务。尽管最佳的手工制定策略仍是最新技术，但深度网络方法能够表达各种不同的策略，因此，通过深度强化学习进一步改善网络性能是未来的直接希望之路。研究。最终，这种方法可能会导致强大的StarCraft机器人更少依赖硬编码策略。

著录项

作者
Justesen Niels; Risi Sebastian;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Predicting combat outcomes and optimizing armies in StarCraft Ⅱ by deep learning [J] . Lee Donghyeon, Kim Man-Je, Ahn Chang Wook Expert systems with applications . 2021,第Deca期

机译：深入学习预测战斗成果和明星争霸Ⅱ的优化军队
2. Motion predictive control for DPS using predicted drifted ship position based on deep learning and replay buffer [J] . Daesoo Lee, Seung Jae Lee International Journal of Naval Architecture and Ocean Engineering . 2020,第1期

机译：基于深度学习和重放缓冲区的预测漂移船位置的DPS运动预测控制
3. Elastic Cloud Logs Traces, Storing and Replaying for Deep Machine Learning [J] . Tariq Daradkeh, Anjali Agarwal, Nishith Goel, Procedia Computer Science . 2020,第5期

机译：弹性云日志痕迹，存储和重放深层机器学习
4. Learning macromanagement in starcraft from replays using deep learning [C] . Niels Justesen, Sebastian Risi IEEE Conference on Computational Intelligence and Games . 2017

机译：使用深度学习从重播中学习星际争霸中的宏观管理
5. Deep Learning-Based Localisation for Autonomous Vehicles =Deep Learning-basierte Lokalisierung für autonome Fahrzeuge [D] . Carrillo Mendoza, Ricardo. 2021

机译：基于深度学习的自主车辆本地化=自动车辆的深度学习本地化
6. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay [O] . Evan Prianto, MyeongSeop Kim, Jae-Han Park, 2020

机译：使用深度加强学习的多臂操纵器的路径规划：软演员 - 与后敏感体验重播
7. Learning Macromanagement in StarCraft from Replays using Deep Learning [O] . Justesen, Niels, Risi, Sebastian 2017

机译：使用深度学习从重播中学习星际争霸中的宏观管理

Learning macromanagement in starcraft from replays using deep learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅