Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication

Pesce Emanuele; Montana Giovanni

首页> 外文期刊>Machine Learning >Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication

【24h】

Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication

机译：通过内存驱动的通信提高小规模多代理深增强学习的协调

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep reinforcement learning algorithms have recently been used to train multiple interacting agents in a centralised manner whilst keeping their execution decentralised. When the agents can only acquire partial observations and are faced with tasks requiring coordination and synchronisation skills, inter-agent communication plays an essential role. In this work, we propose a framework for multi-agent training using deep deterministic policy gradients that enables concurrent, end-to-end learning of an explicit communication protocol through a memory device. During training, the agents learn to perform read and write operations enabling them to infer a shared representation of the world. We empirically demonstrate that concurrent learning of the communication device and individual policies can improve inter-agent coordination and performance in small-scale systems. Our experimental results show that the proposed method achieves superior performance in scenarios with up to six agents. We illustrate how different communication patterns can emerge on six different tasks of increasing complexity. Furthermore, we study the effects of corrupting the communication channel, provide a visualisation of the time-varying memory content as the underlying task is being solved and validate the building blocks of the proposed memory device through ablation studies.

机译：深增强学习算法最近被用来以集中式训练多个交互代理，同时保持其执行权限。当代理商只能获取部分观察并面临需要协调和同步技能的任务时，代理商的沟通发挥着重要作用。在这项工作中，我们向多种代理训练提出了一种使用深度确定性策略梯度来提出多种代理培训，该梯度通过存储器设备通过内容，结束通信协议的并发，端到端学习。在培训期间，代理商学会执行读写操作，使他们能够推断世界的共享表示。我们经验证明了通信设备和各个策略的并发学习可以改善小规模系统中的代理商协调和性能。我们的实验结果表明，该方法在具有最多六个代理的情况下实现了卓越的性能。我们说明了如何在增加复杂性的六种不同任务上出现不同的通信模式。此外，我们研究破坏通信信道的效果，提供时变存储器内容的可视化，因为通过消融研究验证并验证所提出的存储器设备的构建块的底层任务。

著录项

来源
《Machine Learning》 |2020年第10期|1727-1747|共21页
作者
Pesce Emanuele; Montana Giovanni;
展开▼
作者单位

Univ Warwick WMG Coventry CV4 7AL W Midlands England;

Univ Warwick WMG Coventry CV4 7AL W Midlands England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; Multi-agent systems; Artificial neural networks;

机译：加固学习;多助理系统;人工神经网络;

相似文献

外文文献
中文文献
专利

1. Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach [J] . Mlika Zoubeir, Cherkaoui Soumaya Annals of telecommunications . 2021,第9a10期

机译：用于车辆通信的网络切片：多代理深度加强学习方法
2. Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications [J] . IEEE Transactions on Vehicular Technology . 2020,第2期

机译：基于多代理深度强化学习的D2D底层通信频谱分配
3. Learning multi-agent communication with double attentional deep reinforcement learning [J] . Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Autonomous agents and multi-agent systems . 2020,第1期

机译：学习多智能经纪人沟通与双重预付深度加强学习
4. Air-Ground Coordination Communication by Multi-Agent Deep Reinforcement Learning [C] . Ruijin Ding, Feifei Gao, Guanghua Yang, IEEE International Conference on Communications . 2021

机译：多智能经纪深度加固学习空域协调沟通
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. On-Demand Channel Bonding in Heterogeneous WLANs: A Multi-Agent Deep Reinforcement Learning Approach [O] . Hang Qi, Hao Huang, Zhiqun Hu, 2020

机译：异构WLAN中的按需信道绑定：多代理深度强化学习方法
7. UAV-to-Device Underlay Communications: Age of Information Minimization by Multi-agent Deep Reinforcement Learning [O] . Fanyi Wu, Hongliang Zhang, Jianjun Wu, 2021

机译：无人机到设备界面通信：多智能经纪深度加强学习最小化信息的年龄

Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication

摘要

著录项

相似文献

相关主题

期刊订阅