首页> 外文会议>2010 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications >A hybrid P2P and master-slave architecture for intelligent multi-agent reinforcement learning in a distributed computing environment: A case study
【24h】

A hybrid P2P and master-slave architecture for intelligent multi-agent reinforcement learning in a distributed computing environment: A case study

机译:分布式P2P与主从混合架构在分布式计算环境中进行智能多智能体强化学习的案例研究

获取原文

摘要

In this paper, we propose a distributed architecture for reinforcement learning in a multi-agent environment, where agents share information learned via a distributed network. Here we propose a hybrid master/slave and peer-to-peer system architecture, where a master node effectively assigns a work load (a portion of the terrain) to each node. However, this master node also manages communications between all the other system nodes, and in that sense it is a peer-to-peer architecture. It is a loosely-coupled system in that node slaves only know about the existence of the master node, and are only concerned with their work load (portion of the terrain). As part of this architecture, we show how agents are allowed to communicate with other agents in the same or different nodes and share information that pertains to all agents, including the agent obstacle barriers. In particular, one main contribution of the paper is multi-agent reenforcement learning in a distributed system, where the agents do not have complete knowledge and information of their environment, other than what is available on the computing node, the particular agent (s) is (are) running on. We show how agents, running on same or different nodes, coordinate the sharing of their respective environment states/information to collaboratively perform their respective tasks.
机译:在本文中,我们提出了一种分布式架构,用于在多代理环境中进行强化学习,在该环境中,代理共享通过分布式网络学习的信息。在这里,我们提出了一种混合的主/从和对等系统体系结构,其中主节点有效地将工作负载(地形的一部分)分配给每个节点。但是,此主节点还管理所有其他系统节点之间的通信,从这个意义上说,它是对等体系结构。它是一个松散耦合的系统,其中节点从属节点仅知道主节点的存在,并且仅关注其工作负载(地形的一部分)。作为此体系结构的一部分,我们将说明如何允许座席与同一节点或不同节点中的其他座席进行通信,并共享与所有座席有关的信息,包括座席障碍壁垒。特别是,本文的一个主要贡献是分布式系统中的多智能体强化学习,其中的智能体除了对计算节点上可用的东西(特定的智能体)不了解之外,还没有关于其环境的完整知识和信息。正在运行。我们展示了在相同或不同节点上运行的代理如何协调其各自环境状态/信息的共享以协作执行其各自的任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号