...
首页> 外文期刊>SIAM Journal on Control and Optimization >Finite linear programming approximations of constrained discounted markov decision processes
【24h】

Finite linear programming approximations of constrained discounted markov decision processes

机译:约束折扣马尔可夫决策过程的有限线性规划近似

获取原文
获取原文并翻译 | 示例
           

摘要

We consider a Markov decision process (MDP) with constraints under the total expected discounted cost optimality criterion. We are interested in proposing approximation methods of the optimal value of this constrained MDP. To this end, starting from the linear programming (LP) formulation of the constrained MDP (on an infinite-dimensional space of measures), we propose a finite state approximation of this LP problem. This is achieved by suitably approximating a probability measure underlying the random transitions of the dynamics of the system. Explicit convergence orders of the approximations of the optimal constrained cost are obtained. By exploiting convexity properties of the class of relaxed controls, we reduce the LP formulation of the constrained MDP to a finite-dimensional static optimization problem that can be used to obtain explicit numerical approximations of the corresponding optimal constrained cost. A numerical application illustrates our theoretical results.
机译:我们考虑在总预期折现成本最优性准则下具有约束的马尔可夫决策过程(MDP)。我们有兴趣提出这种受约束的MDP的最佳值的近似方法。为此,从约束MDP的线性规划(LP)公式(在度量的无穷维空间上)开始,我们提出了该LP问题的有限状态近似。这可以通过适当地近似系统动力学的随机过渡之下的概率测度来实现。获得最优约束成本近似值的显式收敛阶。通过利用松弛控制类的凸性,我们将约束MDP的LP公式简化为有限维静态优化问题,该问题可用于获得相应最佳约束成本的显式数值近似。数值应用说明了我们的理论结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号