Non-randomized policies for constrained Markov decision processes

Richard C. Chen; Eugene A. Feinberg

首页> 外文期刊>Mathematical Methods of Operations Research >Non-randomized policies for constrained Markov decision processes

【24h】

Non-randomized policies for constrained Markov decision processes

机译：约束马尔可夫决策过程的非随机策略

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses constrained Markov decision processes, with expected discounted total cost criteria, which are controlled by non-randomized policies. A dynamic programming approach is used to construct optimal policies. The convergence of the series of finite horizon value functions to the infinite horizon value function is also shown. A simple example illustrating an application is presented.

机译：本文讨论了受约束的马尔可夫决策过程，该过程具有预期的折现总成本标准，该标准受非随机策略控制。动态规划方法用于构造最佳策略。还显示了一系列有限水平值函数向无限水平值函数的收敛性。给出了说明应用程序的简单示例。

著录项

来源
《Mathematical Methods of Operations Research》 |2007年第1期|165-179|共15页
作者
Richard C. Chen; Eugene A. Feinberg;
展开▼
作者单位

Radar Division Naval Research Laboratory Code 5341 Washington DC 20375 USA;

Department of Applied Mathematics and Statistics State University of New York Stony Brook NY 11794-3600 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Constrained Markov; Decision processes; Dynamic programming; Non-randomized policies;

机译：约束马尔可夫;决策过程;动态规划;非随机策略;

相似文献

外文文献
中文文献
专利

1. Random search for constrained Markov decision processes with multi-policy improvement [J] . Chang Hyeong Soo Automatica . 2015,第Null期

机译：随机搜索约束多策略改进的马尔可夫决策过程
2. LINEAR PROGRAMMING AND CONSTRAINED AVERAGE OPTIMALITY FOR GENERAL CONTINUOUS-TIME MARKOV DECISION PROCESSES IN HISTORY-DEPENDENT POLICIES [J] . XIANPING GUO, YONGHUI HUANG, XINYUAN SONG SIAM Journal on Control and Optimization . 2012,第1期

机译：历史相关策略中一般连续时间马尔可夫决策过程的线性规划和约束平均最优性
3. Optimal policies for constrained average-cost Markov decision processes [J] . Gonzalez-Hernandez J, Villarreal CE TOP: An Official Journal of the Spanish Society of Statistics and Operations Research . 2011,第1期

机译：约束平均成本马尔可夫决策过程的最优策略
4. Non-randomized control of constrained Markov decision processes [C] . Chen R.C., Feinberg E.A. American Control Conference . 2006

机译：受约束的马尔可夫决策过程的非随机控制
5. Structural Results for Constrained Markov Decision Processes [D] . Girard, Cory Jay. 2018

机译：约束马尔可夫决策过程的结构结果
6. Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play [O] . Sherif Abdelfattah, Kathryn Kasmarik, Jiankun Hu 2018

机译：通过内在动机的自我博弈在多目标马尔可夫决策过程中发展稳健的政策覆盖范围
7. Sufficiency of stationary policies for constrained continuous-time Markov decision processes with total cost criteria [O] . Zhang, Yi 2014

机译：约束连续时间的固定策略的充分性具有总成本标准的马尔可夫决策过程

Non-randomized policies for constrained Markov decision processes

摘要

著录项

相似文献

相关主题

期刊订阅