首页> 外文会议>Australasian Joint Conference on Artificial Intelligence >Train Small, Deploy Big: Do Relative World Views Permit Swarm-Safety During Policy Transplantation for Multi-Agent Reinforcement Learning Problems?

【24h】

Train Small, Deploy Big: Do Relative World Views Permit Swarm-Safety During Policy Transplantation for Multi-Agent Reinforcement Learning Problems?

机译：火车小，部署大：亲戚世界观是否允许在策略移植期间允许群安全进行多功能增强学习问题？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to 'train small, deploy big', agent control policies must be transplanted from one trained agent into a larger set of agents for deployment. Given that compute resources and training time generally scale with the number of agents, this approach to generating swarm control policies may be favourable for larger swarms. However, in order for this process to be successful, the agent control policy must be indistinct to the agent on which it is trained so that it can perform as required in its new host agent. Through extensive simulation of a cooperative multi-agent navigation task, it is shown that this indistinctness of agent policies, and therefore the success of the associated learned solution of the transplanted swarm, is dependent upon the way in which an agent views the world: absolute or relative. As a corollary to, and in contrary to naive intuition of, this result, we show that homogeneous agent capability is not enough to guarantee policy indistinctness. The article also discusses what general conditions may be required in order to enforce policy indistinctness.

机译：为了“培训小，部署大”，必须将代理控制策略从一个培训的代理移植到一个更大的代理程序进行部署。鉴于计算资源和培训时间通常随着代理的数量缩放，这种方法为更大的群体产生了群体控制策略。但是，为了使此过程成功，代理控制策略必须模糊到培训的代理，以便它可以根据其新主机代理中的要求执行。通过广泛的仿真合作多功能机会导航任务，表明代理商政策的模糊不清，因此取决于移植的群体的相关解答的成功取决于代理人认为世界的方式：绝对或相对。作为一种必然结果，并且违反了天真的直觉，这结果，我们表明均匀的药剂能力不足以保证政策模糊不清。本文还讨论了可能需要哪些一般条件，以执行政策模糊不清。

著录项

来源
《Australasian Joint Conference on Artificial Intelligence》|2020年|472p|共12页
会议地点
作者
Bradley Eraser; Giuseppe Laurito;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Multi-agent deep reinforcement learning; Policy transplantation; Cooperative navigation; Swarm-safety;

机译：多代理深度加强学习;政策移植;合作导航;群安全;
入库时间 2022-08-21 10:48:09

相似文献

外文文献
专利

1. Intelligent Resource Allocation for Train-to-Train Communication: A Multi-Agent Deep Reinforcement Learning Approach [J] . Zhao Junhui, Zhang Yang, Nie Yiwen, Quality Control, Transactions . 2020,第期

机译：培训到火车沟通的智能资源分配：一种多功能深度加强学习方法
2. Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning [J] . Li Yang, Wang Xinzhi, Wang Wei, Connection Science . 2021,第3期

机译：通过多功能钢筋学习在多个场景环境中学习对抗性政策
3. Energy-efficient operation by cooperative control among trains: A multi-agent reinforcement learning approach [J] . Shuai Su, Xuekai Wang, Tao Tang, Control Engineering Practice . 2021,第Nova期

机译：列车合作控制的节能运行：多功能加强学习方法
4. Train Small, Deploy Big: Do Relative World Views Permit Swarm-Safety During Policy Transplantation for Multi-Agent Reinforcement Learning Problems? [C] . Bradley Eraser, Giuseppe Laurito Australasian Joint Conference on Artificial Intelligence . 2020

机译：火车小，部署大：亲戚世界观是否允许在策略移植期间允许群安全进行多功能增强学习问题？
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Local Policy-sharing Systems for Multi-agent Reinforcement Learning-An Approach from the Learning Classifier System [O] . Hiroyasu INOUE, Katsunori SHIMOHARA, Osamu KATAI 2006

机译：用于多智能经纪增强学习的地方策略共享系统 - 来自学习分类器系统的方法

Train Small, Deploy Big: Do Relative World Views Permit Swarm-Safety During Policy Transplantation for Multi-Agent Reinforcement Learning Problems?

摘要

著录项

相似文献

相关主题

期刊订阅