首页> 外文OA文献 >Approximate Dynamic Programming and Stochastic Approximation Methods for Inventory Control and Revenue Management
【2h】

Approximate Dynamic Programming and Stochastic Approximation Methods for Inventory Control and Revenue Management

机译:库存控制和收入管理的近似动态规划和随机逼近方法

摘要

In this thesis, we develop approximate dynamic programming and stochastic approximation methods for problems in inventory control and revenue management. A unifying feature of the methods we develop is that they exploit the underlying problem structure. By doing so, we are able toestablish certain theoretical properties of our methods, make them more computationally efficient and obtain a faster rate ofconvergence.In the stochastic approximation framework, we develop an algorithm for the monotone estimation problem that uses a projection operator with respect to the max norm onto the order simplex. We show the almost sure convergence of this algorithm and present applications to the Q-learning algorithm and the newsvendor problem with censored demands. Next, we consider a number of inventory control problems for which the so-called base-stock policies are known to beoptimal. We propose stochastic approximation methods to compute the optimal base-stock levels. Existing methods in the literature have only local convergence guarantees. In contrast, we show that the iterates of our methods converge to base-stock levels that are globally optimal. Finally, we consider the revenue managementproblem of optimally allocating seats on a single flight leg todemands from multiple fare classes that arrive sequentially. Wepropose a stochastic approximation algorithm to compute the optimalprotection levels. The novel aspect of our method is that it workswith the nonsmooth version of the problem where capacity can only beallocated in integer quantities. We show that the iterates of ouralgorithm converge to the globally optimal protection levels.In the approximate dynamic programming framework, we use aLagrangian relaxation strategy to make the inventory controldecisions in a distribution system consisting of multiple retailersthat face random demand and a warehouse that supplies the retailers.Our method is based on relaxing the constraints that ensure thenonnegativity of the shipments to the retailers by associatingLagrange multipliers to them. We show that our method naturallyprovides a lower bound on the optimal objective value. Furthermore,a good set of Lagrange multipliers can be obtained bysolving a convex optimization problem.
机译:在本文中,我们针对库存控制和收益管理中的问题开发了近似动态规划和随机近似方法。我们开发的方法的一个统一特征是它们利用了潜在的问题结构。这样,我们就能建立我们方法的某些理论属性,使它们的计算效率更高,并获得更快的收敛速度。在随机逼近框架中,我们针对单调估计问题开发了一种算法,该算法使用投影算子将最大范数放到单纯形定单上。我们展示了该算法几乎可以肯定的收敛性,并将其应用于Q学习算法和带有审查需求的新闻供应商问题。接下来,我们考虑许多库存控制问题,对于这些问题,所谓的基本库存策略是最优的。我们提出了随机逼近方法来计算最佳基础库存水平。文献中的现有方法仅具有局部收敛性保证。相反,我们表明,我们的方法的迭代收敛到全局最优的基本库水平。最后,我们考虑了收益管理问题,即需要根据顺序到达的多个票价舱位在单个航班航段上最佳分配座位。我们提出了一种随机近似算法来计算最佳保护等级。我们方法的新颖之处在于它可以解决问题的非平滑版本,在该版本中,容量只能按整数分配。我们证明算法的迭代收敛到全局最优保护水平。在近似动态规划框架中,我们使用拉格朗日松弛策略在由多个面对随机需求的零售商和一个为零售商供货的仓库组成的分销系统中做出库存控制决策。我们的方法基于放宽约束,这些约束通过将拉格朗日乘数与零售商相关联来确保向零售商发货的负性。我们证明了我们的方法自然为最优目标值提供了一个下限。此外,通过解决凸优化问题,可以获得一组好的拉格朗日乘数。

著录项

  • 作者

    Kunnumkal Sumit;

  • 作者单位
  • 年度 2007
  • 总页数
  • 原文格式 PDF
  • 正文语种 en_US
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号