Particle Guidance: Applying POMDPs to the Optimization of Mid-Course Guidance Laws for Long-Range Missiles

机译：粒子指导：将POMDP应用于远程导弹的中间课程指导法的优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

During the mid-course phase of an air-to-air missile, choosing the optimal Guidance Point (GP) so as to maximize lock-on success and minimize intercept time is critical. Given low computational resources available on board and a very constrained maneuvering time frame, GP-based algorithms must be efficient. We suggest an innovative approach using Reinforcement Learning (RL) to produce finite state controllers that can be executed efficiently - using table lookup - to meet the strict time limits of a target engagement. Instead of hand-crafting a GP-picking algorithm for every combination of sensor and aircraft configuration, one promising alternative models a missile-target engagement as a Partially Observable Markov Decision Process (POMDP) and automatically generates a controller for picking the best GP by solving the POMDP model. Using a recently developed offline algorithm called Monte Carlo Value Iteration (MCVI) we constructed continuous-state POMDP models and solved them directly, without discretizing the entire state space.

机译：在空对空导弹的中间阶段，选择最佳指导点（GP），以最大化锁定成功并最小化拦截时间至关重要。考虑到船上可用的低计算资源和一个非常受限制的机动时间帧，基于GP的算法必须有效。我们建议使用加强学习（RL）的创新方法来生产可以有效地执行的有限状态控制器 - 使用表查找 - 以满足目标参与的严格时间限制。一个有前途的替代模型作为部分观察到的马尔可夫决策过程（POMDP），而不是手工制作一个传感器和飞机配置的GP拣选算法，而不是用于每个传感器和飞机配置的GP拣选算法POMDP模型。使用最近开发的离线算法称为Monte Carlo值迭代（MCVI）我们构建了连续状态POMDP模型，并直接解决了它们，而无需离散状态空间。

著录项

来源
《IFAC Symposium on Automatic Control in Aerospace》|2014年||共6页
会议地点
作者
Bruno Pop-Stefanov; Stephane Le Menec;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词
Missile Guidance; Air-to-Air Engagement; Mid-Course Guidance; Partially Observable Markov Decision Process; Finite State Controller; Uncertainty Modelling; Validation amp; Verification;

机译：导弹指导;空到空中参与;中间课程指导;部分可观察的马尔可夫决策过程;有限状态控制器;不确定性建模;验证＆amp;确认;

相似文献

外文文献
中文文献
专利

1. A joint mid-course and terminal course cooperative guidance law for multi-missile salvo attack [J] . Jie ZENG, Lihua DOU, Bin XIN 中国航空学报（英文版） . 2018,第006期

机译：多导弹齐射攻击的中途和末道联合作战指导法
2. Applying a novel extended Kalman filter to missile-target interception with APN guidance law: A benchmark case study [J] . Shuwen Pan, Hongye Su, Jian Chu, Control Engineering Practice . 2010,第2期

机译：将新型扩展卡尔曼滤波器应用于具有APN制导律的导弹目标拦截：一个基准案例研究
3. Stage Optimization of Anti-Air Missiles Considering Guidance Laws [J] . Seong-Min Hong, Dong-Yeon Lee, Min-Jea Tahk IFAC PapersOnLine . 2019,第12期

机译：考虑制导律的防空导弹阶段优化
4. Particle Guidance: Applying POMDPs to the Optimization of Mid-Course Guidance Laws for Long-Range Missiles [C] . Bruno Pop-Stefanov, Stephane Le Menec IFAC Symposium on Automatic Control in Aerospace . 2014

机译：粒子指导：将POMDP应用于远程导弹的中间课程指导法的优化
5. Applicability study of a neural network for a missile guidance law. [D] . Chow, Jack Tsung-Chieh. 1998

机译：神经网络在导弹制导律中的适用性研究。
6. Terminal attack trajectories of peregrine falcons are described by the proportional navigation guidance law of missiles [O] . Caroline H. Brighton, Adrian L. R. Thomas, Graham K. Taylor 2017

机译：游击猎鹰的终末攻击轨迹由导弹的比例导航制导定律描述
7. An Optimal Fuzzy Logic Guidance Law using Particle Swarm Optimization [O] . Labeed Hassan, Sayed Hosain Sadati, Jalal Karimi 2013

机译：粒子群优化的最佳模糊逻辑引导法
8. Homing Missile Guidance Studies: Observability, Adaptability, and the Linear Exponential Gaussian Guidance Law [R] . Speyer, J. L., Hull, D. G. 1981

机译：寻的导弹制导研究：可观测性，适应性和线性指数高斯制导律

Particle Guidance: Applying POMDPs to the Optimization of Mid-Course Guidance Laws for Long-Range Missiles

摘要

著录项

相似文献

相关主题

期刊订阅