首页> 外文期刊>Proceedings of the Institution of Mechanical Engineers >An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game
【24h】

An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game

机译:固定时间空间差异游戏申请普里雷普雷斯模糊演员 - 评论家算法

获取原文
获取原文并翻译 | 示例
       

摘要

Solving space differential game in an unknown environment remains a challenging problem. This article proposes a pretrained fuzzy actor-critic learning algorithm for dealing with the space pursuit-evasion game in fixed time. It is supposed that the research objects are two agents including one pursuer and one evader in space. A virtual environment, which is defined as the known part of the real environment, is utilized for deriving optimal strategies of the pursuer and the evader, respectively. Through employing the fuzzy inference system, a pretrained process, which is based on the genetic algorithm, is designed to obtain the initial consequent set of the pursuer and the evader. Besides, an actor-critic framework is applied to finely learn the suitable consequent set of the pursuer and evader in the real environment. Numerical experimental results validate the effectiveness of the proposed algorithms on improving the ability of the agents to adapt to the real environment.
机译:在未知环境中解决空间差异游戏仍然是一个具有挑战性的问题。 本文提出了一种普瑞烈的模糊演员 - 评论家,用于在固定时间处理空间追求逃避游戏的学习算法。 值得注意的是,研究对象是两个代理商,包括一个追捕者和一个空间中的一个避难所。 被定义为真实环境的已知部分的虚拟环境,用于分别导出追捕者和避难者的最佳策略。 通过采用模糊推理系统,借鉴基于遗传算法的预磨削过程旨在获得追踪和避难者的初始改性。 此外,演员 - 评论家框架应用于在真实环境中精细地学习合适的追求和避难所的合适的后果。 数值实验结果验证了提出算法的有效性,以提高代理能力适应真实环境的能力。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号