A Distributed Algorithm for Solving a Class of Multi-agent Markov Decision Problems

机译：一种求解一类多代理马尔可夫决策问题的分布式算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper considers a class of infinite horizon Markov decision processes (MTOPs) with multiple decision makers, called agents, and a general joint reward structure, but a special decomposable state/action structure such that each individual agent's actions affect the system's state transitions independently from the actions of all other agents. We introduce the concept of "localization," where each agent need only consider a "local" MDP defined on its own state and action spaces. Based on this localization concept, we propose an iterative distributed algorithm that emulates gradient ascent and which converges to a locally optimal solution for the average reward case. The solution is an "autonomous" joint policy such that each agent's action is based on only its local state.

机译：本文考虑了一类无限的地平线马尔可夫决策过程（MTOPS），具有多个决策者，称为代理商和一般的联合奖励结构，但是一个特殊的可分解状态/行动结构，使得每个代理人的行为独立影响系统的国家过渡。所有其他代理商的行为。我们介绍了“本地化”的概念，其中每个代理只需要考虑在其自己的状态和行动空间上定义的“本地”MDP。基于该本地化概念，我们提出了一种迭代分布式算法，其模拟梯度上升，并将其收敛到平均奖励案例的局部最佳解决方案。该解决方案是“自主”联合政策，使每个代理的行动仅基于其当地国家。

著录项

来源
《IEEE Conference on Decision and Control》|2003年||共6页
会议地点
作者
Hyeong Soo; Michael C. Fu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词

相似文献

外文文献
中文文献
专利

1. Lift-off fellowship report:Using interior point algorithms to solve theHamiltonian cycle problem by exploiting aMarkov decision process embedding [J] . Michael Haythorpe Gazette: the australian mathematical society . 2011,第4期

机译：升空研究金报告：利用内部点算法通过利用马尔可夫决策过程嵌入来解决哈密顿循环问题
2. Quantized subgradient algorithm with limited bandwidth communications for solving distributed optimization over general directed multi-agent networks [J] . Huang Chicheng, Li Huaqing, Xia Dawen, Neurocomputing . 2016,第apra12期

机译：有限带宽通信的量化次梯度算法，用于解决通用有向多主体网络上的分布式优化
3. Robust consensus control for a class of multi-agent systems via distributed PID algorithm and weighted edge dynamics [J] . Shi Chong-Xiao, Yang Guang-Hong Applied mathematics and computation . 2018,第期

机译：通过分布式PID算法和加权边沿动态的一类多代理系统的鲁棒共识控制
4. A distributed algorithm for solving a class of multi-agent Markov decision problems [C] . Hyeong Soo Chang, Fu, M.C. . 2003

机译：一种解决一类多主体马尔可夫决策问题的分布式算法
5. Distributed Tracking Algorithms for Multi-Agent Systems to Solve the Leader-Follower Flocking of Lagrange Networks and Dynamic Average Tracking Problem of Second-Order Systems. [D] . Ghapani, Sheida. 2016

机译：多Agent系统的分布式跟踪算法，解决拉格朗日网络的领导者跟随跟踪和二阶系统的动态平均跟踪问题。
6. Distributed Event Triggering Algorithm for Multi-Agent System over a Packet Dropping Network [O] . Ali Bemani, Niclas Björsell 2021

机译：分布式事件触发在数据包丢弃网络上的多代理系统的触发算法
7. A Distributed Algorithm for Solving a Class of Multi-agent Markov Decision Problems [O] . Chang, Hyeong Soo, Fu, Michael C. 2003

机译：解决一类多智能体马尔可夫决策问题的分布式算法

A Distributed Algorithm for Solving a Class of Multi-agent Markov Decision Problems

摘要

著录项

相似文献

相关主题

期刊订阅