Multi-player pursuit-evasion differential game with equal speed

机译：多人追求避免差动游戏等级

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper suggests a particular form of a reward function for the fuzzy actor-critic learning Automaton (FACLA) algorithm to learn a team of pursuers how to capture a single evader. It is assumed that all the pursuers and the evader have similar speed. The FACLA algorithm with the suggested reward function formulation can be used in a decentralized manner. Each pursuer should learn how to take the right actions by tuning its fuzzy logic controller (FLC) parameters using FACLA algorithm. For the FACLA, the suggested reward function enables each pursuer to update the corresponding value function accurately. The suggested reward function depends on two factors to learn each pursuer how to participate in capturing the evader. The first depends on the difference in the line-of-sight (LOS) between each pursuer in the game and the evader at two consecutive time instant. The second factor depends on the difference between two consecutive Euclidean distance between each pursuer in the game and the evader. Simulation results are given to validate the FACLA with the suggested reward function.

机译：本文介绍了一种特定形式的模糊演员 - 评论家学习自动机（FACLA）算法的奖励功能，用于学习一个追捕者如何捕捉一个避难者。假设所有追捕者和避难者都有类似的速度。具有建议的奖励功能配方的FACLA算法可以以分散的方式使用。每个追求者都应通过使用FACLA算法调整其模糊逻辑控制器（FLC）参数来了解如何采取正确的行动。对于FACLA，建议的奖励功能使每个追求者能够准确更新相应的值功能。建议的奖励职能取决于两个因素来学习每个追求者如何参与捕获逃避者。首先取决于游戏中每个追求者之间的视线与避难者之间的视线之间的差异，连续两个连续的瞬间。第二个因素取决于游戏中每个追索者之间的两个连续欧几里德距离之间的差异。给出了仿真结果以验证建议奖励功能的FACLA。

著录项

来源
《International Automatic Control Conference》|2017年|274p|共6页
会议地点
作者
Ahmad A. Al-Talabi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词
Games; Mathematical model; Fuzzy logic; Game theory; Tuning; Euclidean distance; Simulation;

机译：游戏;数学模型;模糊逻辑;博弈论;调整;欧几里德距离;模拟;

相似文献

外文文献
中文文献
专利

1. Stochastic multi-player pursuit-evasion differential games [J] . Li DX, Cruz JB, Schumacher CJ International Journal of Robust and Nonlinear Control . 2008,第2期

机译：随机多人追逃躲避差分游戏
2. Cooperative control for multi-player pursuit-evasion games with reinforcement learning [J] . Wang Yuanda, Dong Lu, Sun Changyin Neurocomputing . 2020,第Octa28期

机译：利用加固学习的多人追求逃避游戏的合作控制
3. GUARANTEED STRATEGIES FOR NONLINEAR MULTI-PLAYER PURSUIT-EVASION GAMES [J] . DUSAN M. STIPANOVIC, ARIK MBLIKYAN, NAIRA HOVAKIMYAN International game theory review . 2010,第1期

机译：非线性多玩家追逃游戏的保证策略
4. Multi-player pursuit-evasion differential game with equal speed [C] . Ahmad A. Al-Talabi International Automatic Control Conference . 2017

机译：等速多人追逃逃避差分游戏
5. Multi-player pursuit-evasion differential games. [D] . Li, Dongxu. 2006

机译：多人追逃逃避差分游戏。
6. PRISM-games: verification and strategy synthesis for stochastic multi-player games with multiple objectives [O] . Marta Kwiatkowska, David Parker, Clemens Wiltsche -1

机译：PRISM游戏：具有多个目标的随机多玩家游戏的验证和策略综合
7. Stochastic multi-player pursuit-evasion differential games [O] . Dongxu Li, Jose B. Cruz Jr., Corey J. Schumacher 2008

机译：随机多玩家追逐 - 逃避差异游戏

Multi-player pursuit-evasion differential game with equal speed

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅