...
首页> 外文期刊>Mathematics of operations research >Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem
【24h】

Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem

机译:平均成本马尔可夫决策过程的最优性不等式和随机现金余额问题

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

For general state and action space Markov decision processes, we present sufficient conditions for the existence of solutions of the average cost optimality inequalities. These conditions also imply the convergence of both the optimal discounted cost value function and policies to the corresponding objects for the average costs per unit time case. Inventory models are natural applications of our results. We describe structural properties of average cost optimal policies for the cash balance problem; an inventory control problem where the demand may be negative and the decision-maker can produce or scrap inventory. We also show the convergence of optimal thresholds in the finite horizon case to those under the expected discounted cost criterion and those under the expected discounted costs to those under the average costs per unit time criterion.
机译:对于一般状态和动作空间的马尔可夫决策过程,我们为平均成本最优性不等式的解的存在提供了充分的条件。这些条件还意味着针对单位时间平均成本情况,最优折现成本价值函数和策略都将收敛到相应的对象。库存模型是我们结果的自然应用。我们描述了现金余额问题的平均成本最优策略的结构特性;库存控制问题,其中需求可能为负,决策者可以生产或报废库存。我们还显示了在有限水平情况下,最佳阈值与预期折现成本准则下的最优阈值以及预期折现成本下与单位时间平均成本下的最优阈值的收敛性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号