【24h】

JamesBot - an intelligent agent playing StarCraft II

机译:JamesBot-玩《星际争霸2》的智能代理

获取原文

摘要

The most popular method for optimizing a certain strategy based on a reward is Reinforcement Learning (RL). Lately, a big challenge for this technique are computer games such as StarCraft II which is a real-time strategy game, created by Blizzard. The main idea of this game is to fight between agents and control objects on the battlefield in order to defeat the enemy. This work concerns creating an autonomous bot using reinforced learning, in particular, the Q-Learning algorithm for playing StarCraft. JamesBot consists of three parts. State Manager processes relevant information from the environment. Decision Manager consists of a table implementation of the Q-Learning algorithm, which assigns actions to states, and the epsilon-greedy strategy, which determines the behavior of the bot. In turn, Action Manager is responsible for executing commands. Testing bots involves fighting the default (simple) agent built into the game. Although JamesBot played better than the default (random) agent, it failed to gain the ability to defeat the opponent. The obtained results, however, are quite promising in terms of the possibilities of further development.
机译:基于奖励优化特定策略的最流行方法是强化学习(RL)。最近,这种技术面临的一大挑战是诸如暴雪开发的实时战略游戏《星际争霸2》之类的计算机游戏。该游戏的主要思想是在战场上的特工与控制对象之间进行战斗,以击败敌人。这项工作涉及使用强化学习(尤其是玩StarCraft的Q-Learning算法)创建自主机器人。 JamesBot包括三个部分。状态管理器处理来自环境的相关信息。决策管理器由Q-Learning算法的表实现(将动作分配给状态)和epsilon-greedy策略(确定机器人的行为)组成。反过来,动作管理器负责执行命令。测试机器人需要与游戏中内置的默认(简单)代理进行战斗。尽管JamesBot的表现要优于默认(随机)特工,但未能获得击败对手的能力。然而,就进一步发展的可能性而言,所获得的结果是很有希望的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号