首页> 外文会议>Chinese Automation Congress >Research on Decision-making Method for Territorial Defense Based on Fuzzy Reinforcement Learnin
【24h】

Research on Decision-making Method for Territorial Defense Based on Fuzzy Reinforcement Learnin

机译:基于模糊强化学习的国土防御决策方法研究

获取原文

摘要

Protecting important targets, which also called territorial defense, will be an important application of the unmanned aerial vehicles (UAVs) in the future. This paper designs a method to generate interception strategy by learning, which can deal with invaders launched from different directions and different velocity. Firstly, we analyze the influence of initial states on the game results, and explored the initial condition boundary inside which the invader could be intercepted. Secondly, territorial defense game is a complex multi-steps decision-making problem which has continuous action and state spaces. To address this problem, conventional decision methods such as dynamic programming, moving horizon optimization method and Q-learning will cause dimension explosion issue. In this paper, we introduce a fuzzy logic into the actor-critic algorithm and reduce the amount of computation effectively. We consider invaders with different directions and speeds, can offer a more realistic result. Experiments showed that the algorithm can balance the exploration and utilization behavior well and the defender can learn to intercept the invader without prior knowledge.
机译:保护重要目标,也称为领土防御,将是未来无人机的重要应用。本文设计了一种通过学习生成拦截策略的方法,该方法可以处理从不同方向,不同速度发射的入侵者。首先,我们分析了初始状态对游戏结果的影响,并探讨了可以拦截入侵者的初始条件边界。其次,领土防御博弈是一个复杂的多步骤决策问题,具有连续的作用和状态空间。为了解决这个问题,诸如动态规划,移动视野优化方法和Q学习之类的常规决策方法将引起尺寸爆炸问题。在本文中,我们将模糊逻辑引入到参与者评判算法中,并有效地减少了计算量。我们认为入侵者具有不同的方向和速度,可以提供更真实的结果。实验表明,该算法可以很好地平衡勘探和利用行为,防御者可以在没有先验知识的情况下学会拦截入侵者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号