Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Guo Xianping; Huang Yonghui; Zhang Yi

首页> 外文期刊>Applied mathematics and optimization >Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

【24h】

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

机译：限制连续时间马尔可夫决策过程对有限地平线

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper studies the constrained (nonhomogeneous) continuous-time Markov decision processes on the finite horizon. The performance criterion to be optimized is the expected total reward on the finite horizon, while N constraints are imposed on similar expected costs. Introducing the appropriate notion of the occupation measures for the concerned optimal control problem, we establish the following under some suitable conditions: (a) the class of Markov policies is sufficient; (b) every extreme point of the space of performance vectors is generated by a deterministic Markov policy; and (c) there exists an optimal Markov policy, which is a mixture of no more than N + 1 deterministic Markov policies.

机译：本文研究了有限地平线上的受约束（非均匀）连续时间马尔可夫决策过程。优化的性能标准是有限地平线上的预期总奖励，而N约束则对类似的预期成本施加。介绍了有关最佳控制问题的占用措施的适当概念，我们在一些合适的条件下建立了以下内容：（a）马尔可夫政策的班级就足够了; （b）绩效载体空间的每个极端点是由确定性马尔可夫政策产生的; （c）存在最佳的马尔可夫政策，这是不超过N + 1确定式马尔可夫政策的混合物。

著录项

来源
《Applied mathematics and optimization》 |2017年第2期|共25页
作者
Guo Xianping; Huang Yonghui; Zhang Yi;
展开▼
作者单位

Sun Yat Sen Univ Sch Math &

Computat Sci Guangzhou 510275 Guangdong Peoples R China;

Sun Yat Sen Univ Sch Math &

Computat Sci Guangzhou 510275 Guangdong Peoples R China;

Univ Liverpool Dept Math Sci Liverpool L69 7ZL Merseyside England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类最优化的数学理论;
关键词
Continuous-time Markov decision process; Constrained-optimality; Finite horizon; Mixture of N+1 deterministic Markov policies; Occupation measure;

机译：连续时间马尔可夫决策过程;约束 - 最优性;有限地平;N + 1确定型马尔可夫政策的混合物;职业措施;

相似文献

外文文献
中文文献
专利

1. Constrained Continuous-Time Markov Decision Processes on the Finite Horizon [J] . Guo Xianping, Huang Yonghui, Zhang Yi Applied mathematics and optimization . 2017,第2期

机译：限制连续时间马尔可夫决策过程对有限地平线
2. Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates [J] . Guo Xin, Liu Qiuli, Zhang Yi 4OR: Quarterly Journal of the Belgian, French and Italian Operations Research Societies . 2019,第4期

机译：有限地平线风险敏感的连续时间马尔可夫决策流程，具有无限的过渡和成本率
3. Finite horizon continuous-time Markov decision processes with mean and variance criteria [J] . Huang Yonghui Discrete event dynamic systems: Theory and applications . 2018,第4期

机译：有限地平线连续时间马尔可夫决策过程，具有均值和方差标准
4. Optimal control of average reward constrained continuous-time finite Markov decision processes [C] . Feinberg, E.A. . 2002

机译：平均奖励约束连续时间有限马尔可夫决策过程的最优控制
5. Structural Results for Constrained Markov Decision Processes [D] . Girard, Cory Jay. 2018

机译：约束马尔可夫决策过程的结构结果
6. SIMULATION FROM ENDPOINT-CONDITIONED CONTINUOUS-TIME MARKOV CHAINS ON A FINITE STATE SPACE WITH APPLICATIONS TO MOLECULAR EVOLUTION [O] . Asger Hobolth, Eric A. Stone -1

机译：动态模拟端点空调连续时间的马尔可夫链在有限状态空间应用程序分子进化
7. Constrained Continuous-Time Markov Decision Processes on the Finite Horizon [O] . Guo X, Huang Y, Zhang Y 2017

机译：有限地平线上的约束连续时间马尔可夫决策过程

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

摘要

著录项

相似文献

相关主题

期刊订阅