Multi-agent Double Deep Q-Networks

机译：多主体双深度Q网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are many open issues and challenges in the multi-agent reward-based learning field. Theoretical convergence guarantees are lost, and the complexity of the action-space is also exponential to the amount of agents calculating their optimal joint-action. Function approximators, such as deep neural networks, have successfully been used in single-agent environments with high dimensional state-spaces. We propose the Multi-agent Double Deep Q-Networks algorithm, an extension of Deep Q-Networks to the multi-agent paradigm. Two common techniques of multi-agent Q-learning are used to formally describe our proposal, and are tested in a Foraging Task and a Pursuit Game. We also demonstrate how they can generalize to similar tasks and to larger teams, due to the strength of deep-learning techniques, and their viability for transfer learning approaches. With only a small fraction of the initial task's training, we adapt to longer tasks, and we accelerate the task completion by increasing the team size, thus empirically demonstrating a solution to the complexity issues of the multi-agent field.

机译：在基于多主体奖励的学习领域中存在许多未解决的问题和挑战。理论上的收敛保证丢失了，作用空间的复杂性也与计算其最佳联合作用的主体数量成指数关系。函数逼近器（例如深度神经网络）已成功用于具有高维状态空间的单代理环境。我们提出了多智能体双重深层Q网络算法，这是深层Q网络对多智能体范例的扩展。多主体Q学习的两种常用技术被用来正式描述我们的建议，并在觅食任务和追求游戏中进行了测试。我们还演示了由于深度学习技术的强项及其在迁移学习方法中的可行性，它们如何可以推广到相似的任务和更大的团队。仅需训练初始任务的一小部分，我们就可以适应更长的任务，并且通过增加团队规模来加快任务的完成速度，从而从经验上证明了解决多主体领域的复杂性问题的方法。

著录项

来源
《Portuguese conference on artificial intelligence》|2017年|123-134|共12页
会议地点
作者
David Simoes; Nuno Lau; Luis Paulo Reis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. DeepSensing: A Novel Mobile Crowdsensing Framework With Double Deep Q-Network and Prioritized Experience Replay [J] . Tao Xi, Hafid Abdelhakim Senhaji Internet of Things Journal, IEEE . 2020,第12期

机译：DeepSensing：一种新型移动众晶框架，双层Q-Network和优先体验重放
2. Intelligent fault diagnosis for rotating machinery using deep Q-network based health state classification: A deep reinforcement learning approach [J] . Ding Yu, Ma Liang, Ma Jian, Advanced engineering informatics . 2019,第Octa期

机译：基于深度Q网络的健康状态分类的旋转机械智能故障诊断：一种深度强化学习方法
3. Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network [J] . Kazuteru Miyazaki Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2017,第5a125期

机译：深入学习的剥削为导向的学习 - 将利润分享到深度Q网络
4. Multi-agent Double Deep Q-Networks [C] . David Simoes, Nuno Lau, Luis Paulo Reis EPIA Conference on Artificial Intelligence . 2017

机译：多代理双深度Q-Networks
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Non-Communication Decentralized Multi-Robot Collision Avoidance in Grid Map Workspace with Double Deep Q-Network [O] . Lin Chen, Yongting Zhao, Huanjun Zhao, 2021

机译：非通信分散多机器人碰撞避免网格映射工作区双层Q-Network
7. A real-time HIL control system on rotary inverted pendulum hardware platform based on double deep Q-network [O] . Yanyan Dai, KiDong Lee, SukGyu Lee 2021

机译：基于双深Q网的旋转倒立摆硬件平台实时HIL控制系统

Multi-agent Double Deep Q-Networks

摘要

著录项

相似文献

相关主题

期刊订阅