A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning

He Cai; Yaoguo Luo; Huanli GaoJiale ChiShuozhe Wang

首页> 外文期刊>computational intelligence and neuroscience >A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning

【24h】

A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning

机译：一种基于多智能体深度强化学习的群体对抗多阶段半静态训练方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming with the ML-Agent toolkit. By multiphase learning, we split the traditional single training phase into multiple consecutive training phases, where the performance level of the strong team for each phase increases in an incremental way. On the other hand, by semistatic learning, the strong team in all phases will stop learning when fighting against the weak team, which reduces the possibility that the weak team keeps being defeated and learns nothing at all. Comprehensive experiments prove that, in contrast to the traditional single-phase training method, the multiphase semistatic training method proposed in this paper can significantly increase the training efficiency, shedding lights on how the weak could learn from the strong with less time and computational cost.

机译：在本文中，我们提出了一种基于多智能体深度强化学习的群体对抗多阶段半静态训练方法。特别是，我们基于Unity平台构建了一个群体对抗游戏，即3V3坦克战，并通过称为MA-POCA的MDRL算法训练代理，该算法带有ML-Agent工具包。通过多阶段学习，我们将传统的单一训练阶段拆分为多个连续的训练阶段，每个阶段的强势团队的表现水平都会以增量方式提高。另一方面，通过半静态学习，强队在与弱队对战时，各个阶段都会停止学习，这降低了弱队不断被击败而什么也没学到的可能性。综合实验证明，与传统的单阶段训练方法相比，本文提出的多阶段半静态训练方法能够显著提高训练效率，揭示了弱者如何以更少的时间和计算成本向强者学习。

著录项

来源
《computational intelligence and neuroscience》 |2023年第7期|1-10|共10页
作者
He Cai; Yaoguo Luo; Huanli GaoJiale ChiShuozhe Wang;
展开▼
作者单位

School of Automation Science and Engineering, South China University of Technology;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning [J] . Chi Pei, Wei Jiahong, Wu KunDi BinWang Yingxun biomimetics . 2023,第2期

机译：基于多智能体强化学习的无人机群攻击防御对抗生物类决策方法
2. Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method [J] . Yu Zeng, Josep Pou, Changjiang SunSuvajit MukherjeeXu XuAmit Kumar GuptaJiaxin Dong IEEE Transactions on Power Electronics . 2023,第1期

机译：直流微电网ISOP-DAB变换器自主输入均压控制及三相移调制方法：一种基于多智能体深度强化学习的方法
3. Findings from Nanyang Technological University Reveals New Findings on Networks (A Multiagent Quantum Deep Reinforcement Learning Method for Distributed Frequency Control of Islanded Microgrids) [J] . Network Daily News . 2023,第13期

机译：南洋理工大学的研究成果揭示了网络的新发现（一种用于孤岛微电网分布式频率控制的多智能体量子深度强化学习方法）
4. Multiagent Reinforcement Learning for Swarm Confrontation Environments [C] . Guanyu Zhang, Yuan Li, Xinhai Xu, International conference on intelligent robotics and applications . 2019

机译：群对抗环境中的多主体强化学习
5. Robotic Swarm Control Using Deep Reinforcement Learning Strategies Based on Mean-Field Models [D] . Kakish, Zahi. 2021

机译：基于平均场模型的深增强学习策略，机器人群控制
6. Multiagent cooperation and competition with deep reinforcement learning [O] . Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, -1

机译：多主体合作与竞争与深度强化学习
7. Message-Dropout: An Efficient Training Method for Multi-Agent Deep Reinforcement Learning [O] . Woojun Kim, Myungsik Cho, Youngchul Sung 2019

机译：消息 - 丢失：多智能经纪深度加强学习的高效培训方法
8. Hierarchical Multiagent Reinforcement Learning [R] . Ghavamzadeh, M. , Mahadevan, S. 2004

机译：分层多智能体强化学习

A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅