Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning Over Noisy Channels

Tung Tze-Yang; Kobus Szymon; Roig Joan Pujol; Gunduz Deniz

首页> 外文期刊>IEEE Journal on Selected Areas in Communications >Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning Over Noisy Channels

【24h】

Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning Over Noisy Channels

机译：有效的沟通：用于嘈杂渠道的多智能经纪增强学习的联合学习与通信框架

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel formulation of the "effectiveness problem" in communications, put forth by Shannon and Weaver in their seminal work "The Mathematical Theory of Communication", by considering multiple agents communicating over a noisy channel in order to achieve better coordination and cooperation in a multi-agent reinforcement learning (MARL) framework. Specifically, we consider a multi-agent partially observable Markov decision process (MA-POMDP), in which the agents, in addition to interacting with the environment, can also communicate with each other over a noisy communication channel. The noisy communication channel is considered explicitly as part of the dynamics of the environment, and the message each agent sends is part of the action that the agent can take. As a result, the agents learn not only to collaborate with each other but also to communicate "effectively" over a noisy channel. This framework generalizes both the traditional communication problem, where the main goal is to convey a message reliably over a noisy channel, and the "learning to communicate" framework that has received recent attention in the MARL literature, where the underlying communication channels are assumed to be error-free. We show via examples that the joint policy learned using the proposed framework is superior to that where the communication is considered separately from the underlying MA-POMDP. This is a very powerful framework, which has many real world applications, from autonomous vehicle planning to drone swarm control, and opens up the rich toolbox of deep reinforcement learning for the design of multi-user communication systems.

机译：我们提出了一种新颖的制定，在通信中的“有效性问题”中，由香农和韦弗在他们的开创性工作中提出了“数学通信的数学理论”，通过考虑多个代理商，以实现更好的协调与合作多功能加固学习（Marl）框架。具体地，我们考虑多个代理部分观察到的马尔可夫决策过程（MA-POMDP），其中代理除了与环境相互作用之外，还可以在嘈杂的通信信道上彼此通信。嘈杂的通信频道是明确地认为是环境动态的一部分的，并且每个代理发送的消息都是代理可以采用的操作的一部分。结果，代理商不仅学习彼此合作，而且还要在嘈杂的频道上“有效地”沟通。该框架概括了传统的沟通问题，主要目标是在嘈杂的频道上可靠地传达消息，以及在Marl文献中获得最近关注的“学习传播”框架，其中假设底层通信信道无错误。我们通过示例展示使用所提出的框架学习的联合政策优于沟通与基础MA-POMDP分开考虑的情况。这是一个非常强大的框架，拥有许多真实世界的应用，从自动车辆计划无人驾驶到群体控制，并为多用户通信系统设计开辟了深度加强学习的丰富工具箱。

著录项

来源
《IEEE Journal on Selected Areas in Communications 》 |2021年第8期| 2590-2603| 共14页
作者
Tung Tze-Yang; Kobus Szymon; Roig Joan Pujol; Gunduz Deniz;
展开▼
作者单位

Imperial Coll London Dept Elect & Elect Engn Informat Proc & Commun Lab IPC Lab London SW7 2AZ England;

Imperial Coll London Dept Elect & Elect Engn Informat Proc & Commun Lab IPC Lab London SW7 2AZ England;

Samsung Elect Res & Dev Inst UK Staines Upon Thames TW18 4QE England;

Imperial Coll London Dept Elect & Elect Engn Informat Proc & Commun Lab IPC Lab London SW7 2AZ England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Noise measurement; Protocols; Channel coding; Semantics; Reinforcement learning; Modulation; Wireless communication; Learning to communicate; reinforcement learning (RL); multi-agent systems; joint source-channel coding; error correction coding;

机译：噪声测量;协议;频道编码;语义;加强学习;调制;沟通;学习沟通;加固学习（RL）;多算法系统;联合源通道编码;纠错编码;纠错编码;纠错编码;纠错编码;

相似文献

外文文献
中文文献
专利

1. Timesharing-tracking Framework for Decentralized Reinforcement Learning in Fully Cooperative Multi-agent System [J] . Xin Chen, Bo Fu, Yong He, 自动化学报：英文版 . 2014 ,第002期
2. Timesharing-tracking Framework for Decentralized Reinforcement Learning in Fully Cooperative Multi-agent System [J] . Xin Chen, Bo Fu, Yong He, 自动化学报（英文版） . 2014 ,第002期
3. Learning multi-agent communication with double attentional deep reinforcement learning [J] . Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Autonomous agents and multi-agent systems . 2020 ,第1期

机译：学习多智能经纪人沟通与双重预付深度加强学习
4. When Does Communication Learning Need Hierarchical Multi-Agent Deep Reinforcement Learning [J] . Marie Ossenkopf, Mackenzie Jorgensen, Kurt Geihs Cybernetics and Systems . 2019 ,第5a8期

机译：沟通学习何时需要分层多功能深度加强学习
5. CCNet: Cluster-Coordinated Net for Learning Multi-agent Communication Protocols with Reinforcement Learning [J] . Xin Wen, Zheng-Jun Zha, Zilei Wang, JMLR: Workshop and Conference Proceedings . 2018 ,第2010期

机译：CCNet：集群协调网络，用于学习具有强化学习的多代理通信协议
6. A Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation Framework for D2D Communications [C] . Zheng Li, Caili Guo, Yidi Xuan IEEE Global Communications Conference . 2019

机译：基于多Agent深度强化学习的D2D通信频谱分配框架
7. A Coordinated Reinforcement Learning Framework for Multi-Agent Virtual Environments. [D] . Sause, William J. 2013

机译：多代理虚拟环境的协作强化学习框架。
8. On-Demand Channel Bonding in Heterogeneous WLANs: A Multi-Agent Deep Reinforcement Learning Approach [O] . Hang Qi, Hao Huang, Zhiqun Hu, 2020

机译：异构WLAN中的按需信道绑定：多代理深度强化学习方法
9. Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning over Noisy Channels [O] . Tze-Yang Tung, Szymon Kobus, Joan Pujol Roig, 2021

机译：有效的沟通：在嘈杂渠道的多智能经纪增强学习的联合学习与通信框架
10. <learning systems, adaptive systems, signal design, and channel models for space communication - communication theory area< annual report for 1964 [R] . Hancock, J. C., et al. 1965

机译：＆lt;空间通信的学习系统，自适应系统，信号设计和信道模型 - 通信理论领域＆lt; 1964年年度报告

Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning Over Noisy Channels

摘要

著录项

相似文献

相关主题

期刊订阅