...
首页> 外文期刊>Naval Research Logistics >A novel use of value iteration for deriving bounds for threshold and switching curve optimal policies
【24h】

A novel use of value iteration for deriving bounds for threshold and switching curve optimal policies

机译:值迭代的一种新颖用法,用于得出阈值和切换曲线最优策略的界限

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this article, we develop a novel role for the initial function v(0) in the value iteration algorithm. In case the optimal policy of a countable state Markovian queueing control problem has a threshold or switching curve structure, we conjecture, that one can tune the choice of v(0) to generate monotonic sequences of n-stage threshold or switching curve optimal policies. We will show this for three queueing control models, the M/M/1 queue with admission and with service control, and the two-competing queues model with quadratic holding cost. As a consequence, we obtain increasingly tighter upper and lower bounds. After a finite number of iterations, either the optimal threshold, or the optimal switching curve values in a finite number of states is available. This procedure can be used to increase numerical efficiency.
机译:在本文中,我们为值迭代算法中的初始函数v(0)开发了一个新颖的角色。如果可数状态马尔可夫排队控制问题的最优策略具有阈值或切换曲线结构,我们推测,可以调整v(0)的选择以生成n级阈值或切换曲线最优策略的单调序列。我们将针对三种排队控制模型,带有准入和服务控制的M / M / 1队列以及具有二次持有成本的两个竞争队列模型展示这一点。结果,我们获得了越来越严格的上限和下限。经过有限次数的迭代后,可以使用有限状态下的最佳阈值或最佳开关曲线值。此过程可用于提高数值效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号