首页> 外文期刊>Mathematics of operations research >A cost-shaping linear program for average-cost approximate dynamic programming with performance guarantees
【24h】

A cost-shaping linear program for average-cost approximate dynamic programming with performance guarantees

机译:具有性能保证的平均成本近似动态规划的成本成形线性程序

获取原文
获取原文并翻译 | 示例
           

摘要

We introduce a new algorithm based on linear programming for optimization of average-cost Markov decision processes (MDPs). The algorithm approximates the differential cost function of a perturbed MDP via a linear combination of basis functions. We establish a bound on the performance of the resulting policy that scales gracefully with the number of states without imposing the strong Lyapunov condition required by its counterpart in de Farias and Van Roy (de Farias, D. R, B. Van Roy. 2003. The linear programming approach to approximate dynamic programming. Oper Res. 51(6) 850-865]. We investigate implications of this result in the context of a queueing control problem.
机译:我们介绍了一种基于线性规划的新算法,用于优化平均成本马尔可夫决策过程(MDP)。该算法通过基本函数的线性组合来近似被摄动的MDP的微分成本函数。我们对由此产生的政策的执行情况设定了一个界限,该政策可以随州数的增加而适当地扩展,而不会强加de Farias和Van Roy的对手所要求的强Lyapunov条件(de Farias,D.R,B.Van Roy.2003。线性规划方法逼近动态规划,Oper Res。51(6)850-865],我们研究了在排队控制问题中该结果的含义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号