Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

机译：《星际争霸》微管理中具有转移学习框架的深度确定性策略梯度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an intelligent multi-agent approach in a real-time strategy game, StarCraft, based on the deep deterministic policy gradients (DDPG) techniques. An actor and a critic network are established to estimate the optimal control actions and corresponding value functions, respectively. A special reward function is designed based on the agents' own condition and enemies' information to help agents make intelligent control in the game. Furthermore, in order to accelerate the learning process, the transfer learning techniques are integrated into the training process. Specifically, the agents are trained initially in a simple task to learn the basic concept for the combat, such as detouring moving, avoiding and joining attacking. Then, we transfer this experience to the target task with a complex and difficult scenario. From the experiment, it is shown that our proposed algorithm with transfer learning can achieve better performance.

机译：本文基于深度确定性策略梯度（DDPG）技术，在实时策略游戏StarCraft中提出了一种智能多代理方法。建立演员和评论家网络以分别估计最佳控制动作和相应的价值函数。根据特工的自身状况和敌人的信息设计特殊的奖励功能，以帮助特工在游戏中进行智能控制。此外，为了加快学习过程，将转移学习技术集成到培训过程中。具体而言，首先在一个简单的任务中对特工进行培训，以学习战斗的基本概念，例如绕行moving回，躲避和加入进攻。然后，我们将这种经验转移到复杂而困难的情况下的目标任务中。从实验中可以看出，我们提出的带有转移学习的算法可以取得更好的性能。

著录项

来源
《IEEE International Conference on Electro Information Technology》|2019年|410-415|共6页
会议地点
作者
Dong Xie; Xiangnan Zhong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Games; Task analysis; Neural networks; Reinforcement learning; Training; Real-time systems; Intelligent control;

机译：游戏;任务分析;神经网络;强化学习;培训;实时系统;智能控制;

相似文献

外文文献
中文文献
专利

1. StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning [J] . Kun Shao, Yuanheng Zhu, Dongbin Zhao IEEE Transactions on Emerging Topics in Computational Intelligence . 2019,第1期

机译：具有强化学习和课程转移学习功能的《星际争霸》微管理
2. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm [J] . Junta Wu, Huiyun Li Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：具有多种深度确定性政策梯度算法的深度集成钢筋学习
3. Reinforcement learning based optimal control of batch processes using Monte-Carlo deep deterministic policy gradient with phase segmentation [J] . Haeun Yoo, Boeun Kim, Jong Woo Kim, Computers & Chemical Engineering . 2021,第Jana4期

机译：基于跨越蒙特 - 卡洛深度确定性政策梯度的批量学习基于批处理流程的最优控制
4. Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement [C] . Dong Xie, Xiangnan Zhong IEEE International Conference on Electro Information Technology . 2019

机译：在星际微臂中转移学习框架的深度确定性政策梯度
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking [O] . Chujun Liu, Andrew G. Lonsberry, Mark J. Nandor, 2019

机译：控制动态双足行走的深度确定性策略梯度的实现
7. UAV Autonomous Aerial Combat Maneuver Strategy Generation with Observation Error Based on State-Adversarial Deep Deterministic Policy Gradient and Inverse Reinforcement Learning [O] . Weiren Kong, Deyun Zhou, Zhen Yang, 2020

机译：无人机自动空中作战机动策略生成基于国家对冲深度确定性政策梯度和反增强学习的观察误差

Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

摘要

著录项

相似文献

相关主题

期刊订阅