Finite-state approximations to constrained Markov decision processes with Borel spaces

机译：具有Borel空间的约束Markov决策过程的有限状态近似

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the finite-state approximation of a discrete-time constrained Markov decision process with compact state space, under the discounted cost criterion. Using the linear programming formulation of the constrained problem, we prove the convergence of the optimal value function of the finite-state model to the optimal value function of the original model. Under further continuity condition on the transition probability of the original model, we also establish a method to compute approximately optimal policies.

机译：我们考虑了在贴现成本准则下具有紧凑状态空间的离散时间约束马尔可夫决策过程的有限状态近似。使用约束问题的线性规划公式，我们证明了有限状态模型的最优值函数与原始模型的最优值函数的收敛性。在原始模型的转移概率具有进一步连续性的条件下，我们还建立了一种计算近似最优策略的方法。

著录项

来源
《Annual Allerton Conference on Communication, Control, and Computing》|2015年|567-572|共6页
会议地点
作者
Naci Saldi; Serdar Yksel; Tams Linder;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Constrained Markov decision processes; finite-state approximation; quantization; stochastic control;

机译：约束马尔可夫决策过程有限状态逼近量化随机控制;

相似文献

外文文献
中文文献
专利

1. Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes [J] . Naci Saldi IEEE Transactions on Automatic Control . 2019,第7期

机译：折扣和平均成本约束的马尔可夫决策过程的有限状态近似
2. On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces [J] . Saldi Naci, Yuksel Serdar, Linder Tamas Mathematics of operations research . 2017,第4期

机译：关于与Borel空间的有限近似的渐近最优性
3. Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures [J] . Anselmi Jonatha, Dufour Francois, Prieto-Rumeau Tomas Journal of Mathematical Analysis and Applications . 2016,第2期

机译：Borel空间上基于经验测度的连续时间马尔可夫决策过程的可计算近似
4. Finite-state approximations to constrained Markov decision processes with Borel spaces [C] . Naci Saldi, Serdar Y??ksel, Tam??s Linder Annual Allerton Conference on Communication, Control, and Computing . 2015

机译：有限状态近似与Borel空间的受限Markov决策过程
5. Linear approximations for factored Markov decision processes. [D] . Patrascu, Relu-Eugen. 2005

机译：因子马尔可夫决策过程的线性近似。
6. Data-Driven Markov Decision Process Approximations for PersonalizedHypertension Treatment Planning [O] . Greggory J. Schell, Wesley J. Marrero, Mariel S. Lavieri, 2016

机译：数据驱动的个性化马尔可夫决策过程近似高血压治疗计划
7. Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces [O] . Saldi, Naci, Yüksel, Serdar, Linder, Tamás 2016

机译：马尔可夫决策有限逼近的渐近最优性具有Borel空间的过程
8. Blackwell Optimality in the Class of All Policies in Markov Decision Chains witha Borel State Space and Unbounded Rewards [R] . Hordijk, A., Yushkevich, A. A. 2000

机译：具有Borel状态空间和无界奖励的马尔可夫决策链中所有策略类的Blackwell最优性

Finite-state approximations to constrained Markov decision processes with Borel spaces

摘要

著录项

相似文献

相关主题

期刊订阅