Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

Min Fang; Frans C.A. Groen; Hao Li; Jujie Zhang

首页> 外文期刊>Engineering Applications of Artificial Intelligence >Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

【24h】

Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

机译：基于具有动态分区的新型协调树框架的协同多主体强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the research of team Markov games, computing the coordinate team dynamically and determining the joint action policy are the main problems. To deal with the first problem, a dynamic team partitioning method is proposed based on a novel coordinate tree frame. We build a coordinate tree with coordinate agent subset and define two breaching weights to represent the weights of an agent to corporate with the agent subset. Each agent chooses the agent subset with a minimum cost as the coordinate team based on coordinate tree. The Q.-learning based on belief allocation studies multi-agents joint action policy which helps corporative multi-agents joint action policy to converge to the optimum solution. We perform experiments on multiple simulation environments and compare the proposed algorithm with similar ones. Experimental results show that the proposed algorithms are able to dynamically compute the corporative teams and design the optimum joint action policy for corporative teams.

机译：在团队马尔可夫博弈研究中，动态计算坐标团队并确定联合行动策略是主要问题。针对第一个问题，提出了一种基于新颖坐标树框架的动态团队划分方法。我们用座席代理子集构建一个坐标树，并定义两个违规权重以代表座席对拥有座席子集的公司的权重。每个座席根据坐标树选择成本最低的座席子集作为座席团队。基于信念分配的Q学习研究了多主体联合行动策略，该策略有助于企业多主体联合行动策略收敛到最优解。我们在多个仿真环境上进行实验，并将所提出的算法与相似的算法进行比较。实验结果表明，所提出的算法能够动态地计算出企业团队，并为企业团队设计了最优的联合行动策略。

著录项

来源
《Engineering Applications of Artificial Intelligence》 |2014年第1期|191-198|共8页
作者
Min Fang; Frans C.A. Groen; Hao Li; Jujie Zhang;
展开▼
作者单位

School of Computer Science and Technology, Xidian University, China;

Informatics Institute, University of Amsterdam, The Netherlands;

School of Computer Science and Technology, Xidian University, China;

School of Computer Science and Technology, Xidian University, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-agent; Coordination tree; Markov games; Belief propagation; Q learning;

机译：多主体协调树;马尔可夫游戏;信仰传播;Q学习;

相似文献

外文文献
中文文献
专利

1. Multi-Agent Reinforcement Learning Based Distributed Transmission in Collaborative Cloud-Edge Systems [J] . Xu Chunmei, Liu Shengheng, Zhang Cheng, IEEE Transactions on Vehicular Technology . 2021,第2期

机译：基于多功能加强学习的协同云系统中的分布式传输
2. Collaborative multi-agent reinforcement learning based on experience propagation [J] . Fang, Min, Groen, Tecnologias del Aprendizaje, IEEE Revista Iberoamericana de . 2013,第4期

机译：基于经验传播的协同多主体强化学习
3. Collaborative multi-agent reinforcement learning based on experience propagation [J] . Min Fang, Frans C.A.Groen 系统工程与电子技术（英文版） . 2013,第004期

机译：基于经验传播的协作式多主体强化学习
4. Dynamic Partition of Collaborative Multiagent Based on Coordination Trees [C] . Fang Min, Frans C. A. Groen, Li Hao International Conference on Intelligent Autonomous Systems . 2013

机译：基于协调树的协同多算法动态分区
5. A study of collaborative distributed intelligent multi-agent reinforcement learning via multi goals for dynamic agent shortest path-planning [D] . Kim, Minsuk. 2016

机译：通过多目标进行动态代理最短路径规划的协同分布式智能多功能智能多功能多智能智能多功能
6. TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES [O] . Yebin Tao, Lu Wang, Daniel Almirall -1

机译：基于树的加固学习用于估计最佳动态处理方案
7. Zero-Trust Based Distributed Collaborative Dynamic Access Control Scheme with Deep Multi-Agent Reinforcement Learning [O] . Qiuqing Jin, Liming Wang 2021

机译：基于零信任的分布式协作动态访问控制方案，具有深层多功能增强学习

Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

摘要

著录项

相似文献

相关主题

期刊订阅