首页> 外文会议>IEEE Annual Conference on Decision and Control >On the average-cost optimality equations and convergence of discounted-cost relative value functions for inventory control problems with quasiconvex cost functions
【24h】

On the average-cost optimality equations and convergence of discounted-cost relative value functions for inventory control problems with quasiconvex cost functions

机译:具有拟凸成本函数的库存控制问题的平均成本最优方程和折现成本相对值函数的收敛性

获取原文

摘要

Average-cost optimality inequalities imply the existence of stationary optimal policies for Markov Decision Processes with average costs per unit time, and these inequalities hold under broad natural conditions. Additional conditions are required for the validity of the average-cost optimality equations. Recently Feinberg and Liang [10, Theorem 3.2] showed that the equicontinuity of value functions for discounted costs is sufficient additional condition for the validity of average-cost optimality equations for problems with weakly continuous transition probabilities and with possibly unbounded one-step costs, and this condition holds for setup-cost inventory control problems with backorders and convex holding/backlog costs. This paper studies periodic-review setup-cost inventory control problem with backorders and with quasiconvex cost functions and general demands. It is shown that such problems satisfy the equicontinuity condition. Therefore, optimality inequalities hold in the form of equalities with a continuous average-cost relative value function for this problem. In addition, this implies that average-cost optimal (s, S) policies for the inventory control problem can be derived from the average-cost optimality equation. With the additional assumption on the monotonicity of the cost function, we establish the convergence of discounted-cost optimal ordering threshold sa and convergence of discounted-cost relative value functions, when the discount factor converges to 1, to the corresponding optimal threshold and optimal relative value function for the average-cost problem.
机译:平均成本最优不等式意味着存在马尔可夫决策过程的平稳最优策略,其平均时间为每单位时间,并且这些不等式在广泛的自然条件下成立。平均成本最优性方程的有效性还需要其他条件。最近,Feinberg和Liang [10,定理3.2]表明,折现成本的价值函数的等连续性是平均成本最优性方程对于弱连续转移概率和可能具有无穷一步成本的问题的有效性的充分附加条件,并且这种情况适用于带有延期交货和凸面持有/积压成本的设置成本库存控制问题。本文研究了带有延期交货,准凸成本函数和一般需求的定期审查设置成本库存控制问题。结果表明,这些问题满足等连续性条件。因此,对于这个问题,最优性不平等以具有连续平均成本相对值函数的等式形式存在。此外,这意味着可以从平均成本最优方程得出库存控制问题的平均成本最优(s,S)策略。在成本函数单调性的附加假设下,当折扣因子收敛至1时,我们建立了折扣成本最优排序阈值s a 的收敛性和折扣成本相对价值函数的收敛性。平均成本问题的相应最佳阈值和最佳相对值函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号