Multi-agent Double Deep Q-Networks

机译：多代理双深度Q-Networks

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are many open issues and challenges in the multi-agent reward-based learning field. Theoretical convergence guarantees are lost, and the complexity of the action-space is also exponential to the amount of agents calculating their optimal joint-action. Function approximators, such as deep neural networks, have successfully been used in single-agent environments with high dimensional state-spaces. We propose the Multi-agent Double Deep Q-Networks algorithm, an extension of Deep Q-Networks to the multi-agent paradigm. Two common techniques of multi-agent Q-learning are used to formally describe our proposal, and are tested in a Foraging Task and a Pursuit Game. We also demonstrate how they can generalize to similar tasks and to larger teams, due to the strength of deep-learning techniques, and their viability for transfer learning approaches. With only a small fraction of the initial task's training, we adapt to longer tasks, and we accelerate the task completion by increasing the team size, thus empirically demonstrating a solution to the complexity issues of the multi-agent field.

机译：基于Multi-Agent奖励的学习领域存在许多公开问题和挑战。理论收敛保证丢失，并且动作空间的复杂性也是指数的，其代理量计算了最佳关节作用。功能近似器（例如深神经网络）已成功用于具有高维状态空间的单代理环境中。我们提出了多代理双深度Q网络算法，将深度Q-Networks的扩展到多功率范例。多代理Q学习的两种常见技术用于正式描述我们的建议，并在觅食任务和追求游戏中进行测试。由于深度学习技术的强度及其对转移学习方法的可行性，我们还展示了如何概括为类似的任务和更大的团队。只有初始任务培训的一小部分，我们适应更长的任务，我们通过增加团队规模来加速任务完成，从而凭经验证明了对多智能域的复杂性问题的解决方案。

著录项

来源
《EPIA Conference on Artificial Intelligence》|2017年|895p|共12页
会议地点
作者
David Simoes; Nuno Lau; Luis Paulo Reis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. DeepSensing: A Novel Mobile Crowdsensing Framework With Double Deep Q-Network and Prioritized Experience Replay [J] . Tao Xi, Hafid Abdelhakim Senhaji Internet of Things Journal, IEEE . 2020,第12期

机译：DeepSensing：一种新型移动众晶框架，双层Q-Network和优先体验重放
2. Intelligent fault diagnosis for rotating machinery using deep Q-network based health state classification: A deep reinforcement learning approach [J] . Ding Yu, Ma Liang, Ma Jian, Advanced engineering informatics . 2019,第Octa期

机译：基于深度Q网络的健康状态分类的旋转机械智能故障诊断：一种深度强化学习方法
3. Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network [J] . Kazuteru Miyazaki Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2017,第5a125期

机译：深入学习的剥削为导向的学习 - 将利润分享到深度Q网络
4. Multi-agent Double Deep Q-Networks [C] . David Simoes, Nuno Lau, Luis Paulo Reis Portuguese conference on artificial intelligence . 2017

机译：多主体双深度Q网络
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Non-Communication Decentralized Multi-Robot Collision Avoidance in Grid Map Workspace with Double Deep Q-Network [O] . Lin Chen, Yongting Zhao, Huanjun Zhao, 2021

机译：非通信分散多机器人碰撞避免网格映射工作区双层Q-Network
7. A real-time HIL control system on rotary inverted pendulum hardware platform based on double deep Q-network [O] . Yanyan Dai, KiDong Lee, SukGyu Lee 2021

机译：基于双深Q网的旋转倒立摆硬件平台实时HIL控制系统

Multi-agent Double Deep Q-Networks

摘要

著录项

相似文献

相关主题

期刊订阅