Training Cooperative Agents for Multi-Agent Reinforcement Learning: Extended Abstract

机译：培训合作社多智能经纪增强学习：扩展摘要

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Learning and back-propagation has been successfully used to perform centralized training with communication protocols among multiple agents in a cooperative environment. In this paper we present techniques for centralized training of Multi-Agent (Deep) Reinforcement Learning (MARL) using the model-free Deep Q-Network as the baseline model and message sharing between agents. We present a novel, scalable, centralized MARL training technique, which separates the message learning module from the policy module. The separation of these modules helps in faster convergence in complex domains like autonomous driving simulators. A second contribution uses the centrally trained model to bootstrap training of distributed, independent, cooperative agent policies for execution and thus addresses the challenges of noise and communication bottlenecks in real-time communication channels. This paper theoretically and empirically compares our centralized training algorithms to current research in the field of MARL. We also present and release a new OpenAI-Gym environment which can be used for multi-agent research as it simulates multiple autonomous cars driving cooperatively on a highway.

机译：深度学习和背部传播已成功用于在合作环境中使用多个代理之间的通信协议进行集中培训。在本文中，我们使用无模型的深Q-Network作为基线模型和代理之间的消息共享的基线模型和消息共享的集中训练技术来实现多助手（深）加强学习（MARL）的技术。我们提出了一种新颖，可扩展，集中的MARL训练技术，将消息学习模块与策略模块分开。这些模块的分离有助于更快地在自动驾驶模拟器等复杂域中收敛。第二贡献利用集中培训的模型来启动分布式，独立的合作社政策的启动培训，从而解决了实时通信渠道中噪声和通信瓶颈的挑战。本文理论上并经验与我们的集中式训练算法进行了对Marl领域的当前研究。我们还介绍并释放了一个新的Openai-Mave环境，可用于多智能经纪研究，因为它模拟了在高速公路上协同驾驶的多个自主汽车。

著录项

来源
《International Conference on Autonomous Agents and Multiagent Systems》|2019年|1216-1846p|共3页
会议地点
作者
Sushrut Bhalla; Sriram G. Subramanian; Mark Crowley;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
MARL; Multi-Agent reinforcement learning; Reinforcement learning; MultiAgent systems; Autonomous driving;

机译：Marl;多功能钢筋学习;加固学习;多元素系统;自主驾驶;

相似文献

外文文献
中文文献
专利

1. Cooperative zone-based rebalancing of idle overhead hoist transportations using multi-agent reinforcement learning with graph representation learning [J] . Kyuree Ahn, Jinkyoo Park AIIE Transactions . 2021,第10期

机译：基于合作区的闲置架起升降机运输与图形表示学习
2. Multi-Agent Deep Reinforcement Learning-Based Cooperative Edge Caching for Ultra-Dense Next-Generation Networks [J] . Chen Shuangwu, Yao Zhen, Jiang Xiaofeng, IEEE Transactions on Communications . 2021,第4期

机译：基于多功能深度加强学习的合作边缘缓存超密集的下一代网络
3. Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning [J] . Castellini Jacopo, Oliehoek Frans A., Savani Rahul, Autonomous agents and multi-agent systems . 2021,第2期

机译：分析合作多智能体增强学习的动作价值网络的因素
4. Training Cooperative Agents for Multi-Agent Reinforcement Learning: Extended Abstract [C] . Sushrut Bhalla, Sriram G. Subramanian, Mark Crowley International Conference on Autonomous Agents and Multiagent Systems . 2019

机译：培训合作社多智能经纪增强学习：扩展摘要
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents [O] . Ming Tan 1993

机译：多智能体强化学习：独立与合作代理

Training Cooperative Agents for Multi-Agent Reinforcement Learning: Extended Abstract

摘要

著录项

相似文献

相关主题

期刊订阅