Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

机译：在星际微臂中转移学习框架的深度确定性政策梯度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an intelligent multi-agent approach in a real-time strategy game, StarCraft, based on the deep deterministic policy gradients (DDPG) techniques. An actor and a critic network are established to estimate the optimal control actions and corresponding value functions, respectively. A special reward function is designed based on the agents' own condition and enemies' information to help agents make intelligent control in the game. Furthermore, in order to accelerate the learning process, the transfer learning techniques are integrated into the training process. Specifically, the agents are trained initially in a simple task to learn the basic concept for the combat, such as detouring moving, avoiding and joining attacking. Then, we transfer this experience to the target task with a complex and difficult scenario. From the experiment, it is shown that our proposed algorithm with transfer learning can achieve better performance.

机译：本文基于深度确定性政策梯度（DDPG）技术，提出了一种实时战略游戏中的智能多智能经验方法，Starcraft。建立一个演员和批评网络，以分别估计最佳控制动作和相应的值函数。专用奖励功能是根据代理商的自己的条件和敌人的信息设计，以帮助代理商在游戏中做出智能控制。此外，为了加速学习过程，转移学习技术被集成到训练过程中。具体而言，该代理最初在简单的任务中训练，以学习战斗的基本概念，例如纠缠移动，避免和加入攻击。然后，我们将此体验转移到目标任务，具有复杂和困难的情景。从实验开始，显示我们具有转移学习的所提出的算法可以实现更好的性能。

著录项

来源
《IEEE International Conference on Electro Information Technology》|2019年|605p|共6页
会议地点
作者
Dong Xie; Xiangnan Zhong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Games; Task analysis; Neural networks; Reinforcement learning; Training; Real-time systems; Intelligent control;

机译：游戏;任务分析;神经网络;加固学习;培训;实时系统;智能控制;

相似文献

外文文献
中文文献
专利

1. StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning [J] . Kun Shao, Yuanheng Zhu, Dongbin Zhao IEEE Transactions on Emerging Topics in Computational Intelligence . 2019,第1期

机译：具有强化学习和课程转移学习功能的《星际争霸》微管理
2. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm [J] . Junta Wu, Huiyun Li Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：具有多种深度确定性政策梯度算法的深度集成钢筋学习
3. Reinforcement learning based optimal control of batch processes using Monte-Carlo deep deterministic policy gradient with phase segmentation [J] . Haeun Yoo, Boeun Kim, Jong Woo Kim, Computers & Chemical Engineering . 2021,第Jana4期

机译：基于跨越蒙特 - 卡洛深度确定性政策梯度的批量学习基于批处理流程的最优控制
4. Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement [C] . Dong Xie, Xiangnan Zhong IEEE International Conference on Electro Information Technology . 2019

机译：《星际争霸》微管理中具有转移学习框架的深度确定性策略梯度
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking [O] . Chujun Liu, Andrew G. Lonsberry, Mark J. Nandor, 2019

机译：控制动态双足行走的深度确定性策略梯度的实现
7. UAV Autonomous Aerial Combat Maneuver Strategy Generation with Observation Error Based on State-Adversarial Deep Deterministic Policy Gradient and Inverse Reinforcement Learning [O] . Weiren Kong, Deyun Zhou, Zhen Yang, 2020

机译：无人机自动空中作战机动策略生成基于国家对冲深度确定性政策梯度和反增强学习的观察误差

Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

摘要

著录项

相似文献

相关主题

期刊订阅