首页> 外文期刊>Computers & operations research >An approximate dynamic programming approach for comparing firing policies in a networked air defense environment
【24h】

An approximate dynamic programming approach for comparing firing policies in a networked air defense environment

机译:一种在网络防空环境中比较射击策略的近似动态规划方法

获取原文
获取原文并翻译 | 示例

摘要

An objective for effective air defense is to identify the firing policy for interceptor allocation to incoming missiles that minimizes the expected total damage to defended assets over a sequence of engagements. We formulate this dynamic weapon target assignment problem as a Markov decision process and utilize a simulation-based, approximate dynamic programming (ADP) approach to solve problem instances based on a representative scenario. Least squares policy evaluation and least squares temporal differences algorithms are developed to determine approximate solutions. A designed experiment investigates problem features such as conflict duration, attacker and defender weapon sophistication, and defended asset values. An empirical comparison of the ADP policies and two baseline policies (i.e., firing either one or two interceptors at each incoming theater ballistic missile (TBM)) yields several insights: the ADP policies outperform both baseline polices when conflict duration is short and attacker weapons are sophisticated; firing one interceptor at each TBM (regardless of inventory status) outperforms the tested ADP policies when conflict duration is long and attacker weapons are less sophisticated; and firing two interceptors at each TBM (regardless of inventory status), which is the United States Army's currently implemented policy, is never the superlative policy for the test instances investigated. Published by Elsevier Ltd.
机译:有效防空的目标是确定将拦截器分配给进来的导弹的发射策略,以在一系列交战中将对被保护资产的预期总损失降至最低。我们将此动态武器目标分配问题公式化为马尔可夫决策过程,并利用基于仿真的近似动态规划(ADP)方法来基于代表性场景解决问题实例。开发了最小二乘策略评估和最小二乘时差算法来确定近似解。一项经过设计的实验研究了问题特征,例如冲突持续时间,攻击者和防御者武器的复杂程度以及防御资产的价值。通过对ADP政策和两个基准政策(即在每个进场战区弹道导弹(TBM)发射一个或两个拦截器)的经验比较得出以下几点见解:当冲突持续时间短且攻击者的武器很短时,ADP政策优于两个基准政策。精巧当冲突持续时间长且攻击者武器不那么复杂时,在每个TBM上发射一个拦截器(无论库存状态如何)都优于经过测试的ADP策略;在每个TBM上发射两个拦截器(无论库存状态如何)(这是美国陆军目前实施的政策),对于被调查的测试实例而言,绝不是最高级的政策。由Elsevier Ltd.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号