An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game

Wang Xiao; Shi Peng; Schwartz Howard; Zhao Yushan

首页> 外文期刊>Proceedings of the Institution of Mechanical Engineers >An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game

【24h】

An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game

机译：固定时间空间差异游戏申请普里雷普雷斯模糊演员 - 评论家算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Solving space differential game in an unknown environment remains a challenging problem. This article proposes a pretrained fuzzy actor-critic learning algorithm for dealing with the space pursuit-evasion game in fixed time. It is supposed that the research objects are two agents including one pursuer and one evader in space. A virtual environment, which is defined as the known part of the real environment, is utilized for deriving optimal strategies of the pursuer and the evader, respectively. Through employing the fuzzy inference system, a pretrained process, which is based on the genetic algorithm, is designed to obtain the initial consequent set of the pursuer and the evader. Besides, an actor-critic framework is applied to finely learn the suitable consequent set of the pursuer and evader in the real environment. Numerical experimental results validate the effectiveness of the proposed algorithms on improving the ability of the agents to adapt to the real environment.

机译：在未知环境中解决空间差异游戏仍然是一个具有挑战性的问题。本文提出了一种普瑞烈的模糊演员 - 评论家，用于在固定时间处理空间追求逃避游戏的学习算法。值得注意的是，研究对象是两个代理商，包括一个追捕者和一个空间中的一个避难所。被定义为真实环境的已知部分的虚拟环境，用于分别导出追捕者和避难者的最佳策略。通过采用模糊推理系统，借鉴基于遗传算法的预磨削过程旨在获得追踪和避难者的初始改性。此外，演员 - 评论家框架应用于在真实环境中精细地学习合适的追求和避难所的合适的后果。数值实验结果验证了提出算法的有效性，以提高代理能力适应真实环境的能力。

著录项

来源
《Proceedings of the Institution of Mechanical Engineers》 |2021年第14期|2095-2112|共18页
作者
Wang Xiao; Shi Peng; Schwartz Howard; Zhao Yushan;
展开▼
作者单位

Beihang Univ Sch Astronaut 37 Xueyuan Rd Beijing 100191 Peoples R China;

Beihang Univ Sch Astronaut 37 Xueyuan Rd Beijing 100191 Peoples R China;

Carleton Univ Dept Syst & Comp Engn Ottawa ON Canada;

Beihang Univ Sch Astronaut 37 Xueyuan Rd Beijing 100191 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Differential game; reinforcement learning; actor-critic; fuzzy system;

机译：差动游戏;钢筋学习;演员 - 评论家;模糊系统;
入库时间 2022-08-19 03:08:41

相似文献

外文文献
专利

1. An actor-Critic algorithm for multi-agent learning in queue-based stochastic games [J] . D. Krishna Sundar, K. Ravikumar Neurocomputing . 2014,第mara15期

机译：基于队列的随机博弈中多主体学习的actor-Critic算法
2. A Decentralized Fuzzy Learning Algorithm for Pursuit-Evasion Differential Games with Superior Evaders [J] . Awheda Mostafa D., Schwartz Howard M. Journal of Intelligent & Robotic Systems: Theory & Application . 2016,第1期

机译：具有高级避让者的追逃性微分游戏的分散模糊学习算法
3. Applying game mechanics and student-generated questions to an online puzzle-based game learning system to promote algorithmic thinking skills [J] . Hsu Chih-Chao, Wang Tzone-I. Computers & education . 2018,第JUNa期

机译：将游戏机制和学生生成的问题应用于基于在线拼图的游戏学习系统，以提高算法思维能力
4. Fuzzy actor-critic learning automaton algorithm for the pursuit-evasion differential game [C] . Ahmad A. Al-Talabi International Automatic Control Conference . 2017

机译：追逃微分游戏的模糊行为者学习自动机算法
5. Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning. [D] . Al Faiya, Badr. 2012

机译：使用强化模糊学习在追逃性差分游戏中学习。
6. A Pre-Trained Fuzzy Reinforcement Learning Method for the Pursuing Satellite in a One-to-One Game in Space [O] . Xiao Wang, Peng Shi, Yushan Zhao, 2020

机译：在太空一对一游戏中追踪卫星的预训练模糊强化学习方法
7. Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games [O] . Prasad, H. L, Prashanth, L. A., Bhatnagar, Shalabh 2015

机译：N-player中学习纳什均衡的演员批评算法一般和游戏

An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game

摘要

著录项

相似文献

相关主题

期刊订阅