首页> 中文期刊>系统工程与电子技术 >基于RBF神经网络的Q学习飞行器隐蔽接敌策略

基于RBF神经网络的Q学习飞行器隐蔽接敌策略

     

摘要

基于马尔科夫决策过程框架研究了三维空间内隐蔽接敌策略的强化学习方法,定义了环境模型中的优势区域和暴露区域.针对高维状态空间策略学习所面临的维数灾问题,给出基于径向基神经网络(radial basis function neural network,RBFNN)的Q学习算法,说明了训练样本的分级采样方法,并针对不同情况下的接敌机动策略学习进行了仿真分析.仿真结果表明,借助于合理的分级采样方法,基于RBFNN的Q学习算法能有效生成隐蔽接敌策略.%Based on the Markov decision process theory, a reinforcement learning method for stealthy engagement strategy for air vehicles in 3D space is proposed. The advantage region and the exposure region for the environment modeling are established. In order to overcome the dimensional disaster problem, a Q-learning algorithm based on the radial basis function neural network (RBFNN) is put forward and a ranked sampling method is explained. Then simulations for two different situations are carried out,and the results show that the proposed algorithm is effective for the stealthy engagement strategy through reasonable ranked sampling methods.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号