Integrating Motivated Learning and k-Winner-Take-All to Coordinate Multi-agent Reinforcement Learning

机译：整合动机学习和k-赢家通吃以协调多主体强化学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work addresses the coordination issue in distributed optimization problem (DOP) where multiple distinct and time-critical tasks are performed to satisfy a global objective function. The performance of these tasks has to be coordinated due to the sharing of consumable resources and the dependency on non-consumable resources. Knowing that it can be sub-optimal to predefine the performance of the tasks for large DOPs, the multi-agent reinforcement learning (MARL) framework is adopted wherein an agent is used to learn the performance of each distinct task using reinforcement learning. To coordinate MARL, we propose a novel coordination strategy integrating Motivated Learning (ML) and the k-Winner-Take-All (k-WTA) approach. The priority of the agents to the shared resources is determined using Motivated Learning in real time. Due to the finite amount of the shared resources, the k-WTA approach is used to allow for the maximum number of the most urgent tasks to execute. Agents performing tasks dependent on resources produced by other agents are coordinated using domain knowledge. Comparing our proposed contribution to the existing approaches, results from our experiments based on a 16-task DOP and a 68-task DOP show our proposed approach to be most effective in coordinating multi-agent reinforcement learning.

机译：这项工作解决了分布式优化问题（DOP）中的协调问题，在分布式优化问题中，执行多个不同且对时间要求严格的任务来满足全局目标函数。由于消耗性资源的共享和对非消耗性资源的依赖，必须协调这些任务的执行。知道为大型DOP预定义任务的执行情况可能不是最佳选择，因此采用了多代理强化学习（MARL）框架，其中使用了一个代理通过强化学习来学习每个不同任务的执行。为了协调MARL，我们提出了一种结合了动机学习（ML）和k-Winner-Take-All（k-WTA）方法的新颖协调策略。代理对共享资源的优先级是使用“动机学习”实时确定的。由于共享资源的数量有限，k-WTA方法用于允许执行最紧急任务的最大数量。使用域知识来协调根据其他代理产生的资源执行任务的代理。比较我们提议的对现有方法的贡献，基于16任务DOP和68任务DOP的实验结果表明，我们提出的方法在协调多主体强化学习方面最有效。

著录项

来源
《The 2014 IEEE/WIC/ACM International Conference on Intelligent Agent Technology》|2014年|190-197|共8页
会议地点 Warsaw(PL)
作者
Teck-Hou Teng; Ah-Hwee Tan; Starzyk J.A.; Yuan-Sin Tan; Loo-Nin Teow;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
learning (artificial intelligence); multi-agent systems; optimisation; resource allocation; 16-task DOP; 68-task DOP; MARL framework; consumable resource sharing; distributed optimization problem; global objective function; k-WTA approach; k-winner-take-all approach; motivated learning; multiagent reinforcement learning framework; nonconsumable resource dependency; time-critical tasks; Educational institutions; Games; Learning (artificial intelligence); Linear programming; Optimization; Pain; Resource management; Moti;

机译：学习（人工智能）;多主体系统;优化;资源分配; 16任务DOP; 68任务DOP; MARL框架;消耗性资源共享;分布式优化问题;全局目标函数; k-WTA方法; k-winner-全面学习;激励学习;多主体强化学习框架;非消耗性资源依赖;时间紧迫的任务;教育机构;游戏;学习（人工智能）;线性规划;优化;痛苦;资源管理; Moti;

相似文献

外文文献
中文文献
专利

1. A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems [J] . Predrag T. To?i?, Ricardo Vilalta Procedia Computer Science . 2010,第1期

机译：强化学习，共同学习和元学习的统一框架，如何在协作式多智能体系统中进行协调
2. CCNet: Cluster-Coordinated Net for Learning Multi-agent Communication Protocols with Reinforcement Learning [J] . Xin Wen, Zheng-Jun Zha, Zilei Wang, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：CCNet：集群协调网络，用于学习具有强化学习的多代理通信协议
3. Coordinated control of gas supply system in PEMFC based on multi-agent deep reinforcement learning [J] . Li Jiawen, Yu Tao, Yang Bo International journal of hydrogen energy . 2021,第68期

机译：基于多智能经纪深增强学习的PEMFC气体供应系统协调控制
4. Integrating Motivated Learning and k-Winner-Take-All to Coordinate Multi-agent Reinforcement Learning [C] . Teck-Hou Teng, Ah-Hwee Tan, Starzyk J.A., IEEE/WIC/ACM International Conferences on Intelligent Agent Technologies . 2014

机译：整合动机学习和K-WINNER-TAIL-ALL来协调多智能经纪增强学习
5. Decentralized Coordinated Optimal Ramp Metering using Multi-agent Reinforcement Learning [D] . Rezaee, Kasra 2014

机译：使用多主体强化学习的分散式协调最佳斜坡计量
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems [O] . Tošić Predrag T., Vilalta Ricardo 2010

机译：强化学习，共同学习和元学习的统一框架，如何在协作式多智能体系统中进行协调
8. Intrinsically Motivated Reinforcement Learning: A Promising Framework for Developmental Robot Learning [R] . Stout, A. , Konidaris, G. D. , Barto, A. G. 2005

机译：本质动机强化学习：发展机器人学习的有前途的框架

Integrating Motivated Learning and k-Winner-Take-All to Coordinate Multi-agent Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅