Finite linear programming approximations of constrained discounted markov decision processes

Dufour F.; Prieto-Rumeau T.

首页> 外文期刊>SIAM Journal on Control and Optimization >Finite linear programming approximations of constrained discounted markov decision processes

【24h】

Finite linear programming approximations of constrained discounted markov decision processes

机译：约束折扣马尔可夫决策过程的有限线性规划近似

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a Markov decision process (MDP) with constraints under the total expected discounted cost optimality criterion. We are interested in proposing approximation methods of the optimal value of this constrained MDP. To this end, starting from the linear programming (LP) formulation of the constrained MDP (on an infinite-dimensional space of measures), we propose a finite state approximation of this LP problem. This is achieved by suitably approximating a probability measure underlying the random transitions of the dynamics of the system. Explicit convergence orders of the approximations of the optimal constrained cost are obtained. By exploiting convexity properties of the class of relaxed controls, we reduce the LP formulation of the constrained MDP to a finite-dimensional static optimization problem that can be used to obtain explicit numerical approximations of the corresponding optimal constrained cost. A numerical application illustrates our theoretical results.

机译：我们考虑在总预期折现成本最优性准则下具有约束的马尔可夫决策过程（MDP）。我们有兴趣提出这种受约束的MDP的最佳值的近似方法。为此，从约束MDP的线性规划（LP）公式（在度量的无穷维空间上）开始，我们提出了该LP问题的有限状态近似。这可以通过适当地近似系统动力学的随机过渡之下的概率测度来实现。获得最优约束成本近似值的显式收敛阶。通过利用松弛控制类的凸性，我们将约束MDP的LP公式简化为有限维静态优化问题，该问题可用于获得相应最佳约束成本的显式数值近似。数值应用说明了我们的理论结果。

著录项

来源
《SIAM Journal on Control and Optimization》 |2013年第2期|共27页
作者
Dufour F.; Prieto-Rumeau T.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类运筹学;
关键词
Approximation of Markov decision processes; Constrained Markov decision processes; Linear programming approach to control problems; Quantization;

机译：马尔可夫决策过程的逼近;受约束的马尔可夫决策过程;控制问题的线性规划方法;量化;

相似文献

外文文献
中文文献
专利

1. Finite linear programming approximations of constrained discounted markov decision processes [J] . Dufour F., Prieto-Rumeau T. SIAM Journal on Control and Optimization . 2013,第2期

机译：约束折扣马尔可夫决策过程的有限线性规划近似
2. Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes [J] . Naci Saldi IEEE Transactions on Automatic Control . 2019,第7期

机译：折扣和平均成本约束的马尔可夫决策过程的有限状态近似
3. Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes [J] . Dufour F., Prieto-Rumeau T. Applied mathematics and optimization . 2016,第1期

机译：约束折扣马尔可夫决策过程的线性规划公式的可解性条件
4. An application to the finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors [C] . Xiao Wu, Junyu Zhang World Congress on Intelligent Control and Automation . 2014

机译：可变折扣因子的离散时间马尔可夫决策过程在第一阶段模型有限逼近中的应用
5. Linear approximations for factored Markov decision processes. [D] . Patrascu, Relu-Eugen. 2005

机译：因子马尔可夫决策过程的线性近似。
6. Composition of Web Services Using Markov Decision Processes and Dynamic Programming [O] . Víctor Uc-Cetina, Francisco Moo-Mena, Rafael Hernandez-Ucan 2015

机译：使用Markov决策过程和动态规划的Web服务组合
7. Finite State Approximations for Countable State Infinite Horizon Discounted Markov Decision Processes [O] . Sjur D. Flåm 1987

机译：可数状态无限时空折扣马尔可夫决策过程的有限状态逼近

Finite linear programming approximations of constrained discounted markov decision processes

摘要

著录项

相似文献

相关主题

期刊订阅