Decentralized learning in multiple pursuer-evader Markov games

机译：在多个追逐者马氏游戏中的去中心化学习

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We represent the multiple pursuers and evaders game as a Markov game and each player as a decentralized unit that has to work independently in order to complete a task. Most proposed solutions for this distributed multiagent decision problem require some sort of central coordination. In this paper, we intend to model each player as a learning automata (LA) and let them evolve and adapt in order to solve the difficult problem they have at hand. We are also going to show that using the proposed learning process, the players' policies will converge to an equilibrium point. Simulations of such scenarios with multiple pursuers and evaders are presented in order to show the feasibility of the approach.

机译：我们将多个追随者和逃避者游戏表示为马尔可夫游戏，而每个玩家则作为分散的单位来代表，这些单位必须独立工作才能完成任务。针对此分布式多主体决策问题提出的大多数解决方案都需要某种中央协调。在本文中，我们打算将每个参与者建模为学习自动机（LA），让他们发展和适应，以解决他们面临的难题。我们还将证明，通过提议的学习过程，参与者的政策将收敛到一个平衡点。为了证明这种方法的可行性，提出了具有多个追踪者和逃避者的这种情况的仿真。

著录项

来源
《2011 19th Mediterranean Conference on Control Automation (MED)》|2011年|p.1379-1385|共7页
会议地点
作者
Givigi Sidney; Schwartz Howard M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术及设备;
关键词

相似文献

外文文献
中文文献
专利

1. Decentralized strategy selection with learning automata for multiple pursuer-evader games [J] . Sidney N Givigi Jr, Howard M Schwartz Adaptive Behavior . 2014,第4期

机译：具有学习自动机的分散式策略选择，可进行多种追击者躲避游戏
2. Decentralized Learning in Markov Games [J] . Vrancx P., Verbeeck K., Nowe A. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2008,第4期

机译：马尔可夫游戏中的去中心化学习
3. A Decentralized Partially Observable Markov Decision Model with Action Duration for Goal Recognition in Real Time Strategy Games [J] . Peng Jiao, Kai Xu, Shiguang Yue, Discrete dynamics in nature and society . 2017,第Pta2期

机译：一种分散的部分可观察的马尔可夫决策模型，其在实时战略游戏中的目标识别持续时间
4. Decentralized Learning in Multiple Pursuer-Evader Markov Games [C] . Sidney Givigi, Howard M. Schwartz Mediterranean Conference on Control Automation . 2011

机译：多个追捕者马尔可夫游戏的分散学习
5. DECENTRALIZED LEARNING IN GAMES AND FINITE MARKOV CHAINS (CONTROL, PROCESSES, SYSTEMS, THEORY). [D] . WHEELER, RICHARD MORGAN, JR. 1985

机译：游戏和有限马尔可夫链（控制，过程，系统，理论）中的分散学习。
6. Corrigendum: Language Learning Enhanced by Massive Multiple Online Role-Playing Games (MMORPGs) and the Underlying Behavioral and Neural Mechanisms [O] . Yongjun Zhang, Hongwen Song, Xiaoming Liu, 2019

机译：更正：大规模的多个在线角色扮演游戏（MMORPG）增强的语言学习以及潜在的行为和神经机制
7. A Decentralized Approach to Pursuer-Evader Games with Multiple Superior Evaders in Noisy Environments [O] . Mo Wei, Genshe Chen, Jose B. Cruz Jr., 2007

机译：嘈杂环境中具有多个高级逃避者的追逐逃避游戏的分散方法

Decentralized learning in multiple pursuer-evader Markov games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅