首页> 外文期刊>aerospace >Integrated Guidance-and-Control Design for Three-Dimensional Interception Based on Deep-Reinforcement Learning
【24h】

Integrated Guidance-and-Control Design for Three-Dimensional Interception Based on Deep-Reinforcement Learning

机译:基于深度强化学习的三维拦截一体化制控设计

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This study applies deep-reinforcement-learning algorithms to integrated guidance and control for three-dimensional, high-maneuverability missile-target interception. Dynamic environment, reward functions concerning multi-factors, agents based on the deep-deterministic-policy-gradient algorithm, and action signals with pitch and yaw fins as control commands were constructed in the research, which control the missile in order to intercept targets. Firstly, the missile-interception system includes dynamics such as the inertia of the missile, the aerodynamic parameters, and fin delays. Secondly, to improve the convergence speed and guidance accuracy, a convergence factor for the angular velocity of the target line of sight and deep dual-filter methods were introduced into the design of the reward function. The method proposed in this paper was then compared with traditional proportional navigation. Next, many simulations were carried out on high-maneuverability targets with different initial conditions by randomization. The numerical-simulation results showed that the proposed guidance strategy has higher guidance accuracy and stronger robustness and generalization capability against the aerodynamic parameters.
机译:本研究将深度强化学习算法应用于三维、高机动性导弹目标拦截的集成制导与控制。本文构建了动态环境、多因素奖励函数、基于深度确定性策略梯度算法的智能体、以俯仰和偏航鳍为控制指令的动作信号,控制导弹拦截目标。首先,导弹拦截系统包括导弹惯性、空气动力学参数和鳍片延迟等动力学。其次,为了提高收敛速度和制导精度,在奖励函数的设计中引入了目标视线角速度的收敛因子和深度双滤波方法。然后将本文提出的方法与传统的比例导航进行比较。接下来,通过随机化对不同初始条件的高机动性目标进行了多次模拟。数值模拟结果表明,所提制导策略对空气动力学参数具有更高的制导精度和更强的鲁棒性和泛化能力。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号