...
首页> 外文期刊>Annals of Operations Research >Approximate dynamic programming for the military inventory routing problem
【24h】

Approximate dynamic programming for the military inventory routing problem

机译:军事库存路由问题的近似动态规划

获取原文
获取原文并翻译 | 示例
           

摘要

The United States Army can benefit from effectively utilizing cargo unmanned aerial vehicles (CUAVs) to perform resupply operations in combat environments to reduce the use of manned (ground and aerial) resupply that incurs risk to personnel. We formulate a Markov decision process (MDP) model of an inventory routing problem (IRP) with vehicle loss and direct delivery, which we label the military IRP (MILIRP). The objective of the MILIRP is to determine CUAV dispatching and routing policies for the resupply of geographically dispersed units operating in an austere, combat environment. The large size of the problem instance motivating this research renders dynamic programming algorithms inappropriate, so we utilize approximate dynamic programming (ADP) methods to attain improved policies (relative to a benchmark policy) via an approximate policy iteration algorithmic strategy utilizing least squares temporal differencing for policy evaluation. We examine a representative problem instance motivated by resupply operations experienced by the United States Army in Afghanistan both to demonstrate the applicability of our MDP model and to examine the efficacy of our proposed ADP solution methodology. A designed computational experiment enables the examination of selected problem features and algorithmic features vis-a-vis the quality of solutions attained by our ADP policies. Results indicate that a 4-crew, 8-CUAV unit is able to resupply 57% of the demand from an 800-person organization over a 3-month time horizon when using the ADP policy, a notable improvement over the 18% attained using a benchmark policy. Such results inform the development of procedures governing the design, development, and utilization of CUAV assets for the resupply of dispersed ground combat forces.
机译:美国陆军可以从有效利用货物无人驾驶航空公司(CUAV)中,以便在战斗环境中进行再生操作,以减少载人(地面和空中)的使用,再次冒险冒险。我们制定了具有车辆损失和直接交付的库存路由问题(IRP)的马尔可夫决策过程(MDP)模型,我们标记了军事IRP(Milirp)。 MILIRP的目的是确定CUAV调度和路由政策,用于在AUSTERE,战斗环境中运行的地理上分散单元的补充。问题实例的大尺寸激励本研究呈现动态编程算法不合适,因此我们利用近似的动态编程(ADP)方法通过近似政策迭代算法策略利用最小二乘时间差异来实现改进的策略(相对于基准策略)政策评估。我们审查了由美国陆军在阿富汗经历的复苏业务的代表性问题,以证明我们的MDP模型的适用性,并检查我们提出的ADP解决方案方法的疗效。设计的计算实验可以检查选定的问题特征和算法功能VIS-A-VIS,我们的ADP政策所获得的解决方案质量。结果表明,在使用ADP政策时,一家4册8-CUAV单位能够在3个月的时间范围内再次从800人组织中排斥57%的需求,超过18%的显着改善基准政策。这些结果可通知管理CUAV资产的设计,开发和利用的程序,了解分散的地面战斗力的补给。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号