RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES

Guo Xianping; Liao Zhong-Wei

首页> 外文期刊>SIAM Journal on Control and Optimization >RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES

【24h】

RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES

机译：风险敏感折扣连续时间马尔可夫决策流程，具有无限性率

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper attempts to study the risk-sensitive discounted continuous-time Markov decision processes with unbounded transition and cost rates. Different from the case of bounded transition/cost rates, the optimality equation (OE) no longer has a solution satisfying the uniform convergence condition introduced in the existing literature. Thus, we first replace the uniform convergence condition of the solution with a new boundary condition. Then, we find mild conditions imposed on the primitive data of the decision processes, which not only ensure the existence of a solution to the OE but also are the generalization of the bounded transition/cost rates conditions. Furthermore, using the characterization of the boundary condition and a novel technique, from the OE we prove the existence of an optimal policy out of the class of randomized history-dependent policies. Finally, we present two examples with unbounded transition/cost rates to illustrate our results.

机译：本文试图研究风险敏感的折扣连续连续时间马尔可夫决策过程，具有无限的转换和成本率。与有界转变/成本速率的情况不同，最优性方程（OE）不再具有满足现有文献中引入的统一收敛条件的解决方案。因此，我们首先用新的边界条件更换溶液的均匀收敛条件。然后，我们发现对决策过程的原始数据施加的温和条件，这不仅可以确保对OE的解决方案的存在，而且还是界限转变/成本率条件的概括。此外，使用边界条件的表征和新技术，从OE中我们证明存在于随机历史依赖性策略类别中的最佳策略。最后，我们提出了两个具有无限转换/成本率的示例，以说明我们的结果。

著录项

来源
《SIAM Journal on Control and Optimization》 |2019年第6期|共27页
作者
Guo Xianping; Liao Zhong-Wei;
展开▼
作者单位

Sun Yat Sen Univ Sch Math Guangzhou Peoples R China;

South China Normal Univ South China Res Ctr Appl Math &

Interdisciplinary Guangzhou Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类运筹学;控制论、信息论（数学理论）;
关键词
continuous-time Markov decision process; unbounded transition and cost rates; risk-sensitive discounted optimality; the optimality equation; Foster-Lyapunov and logarithm growth conditions;

机译：连续时间马尔可夫决策过程;无限的过渡和成本率;风险敏感的折扣最优性;最优性方程;福斯特 - Lyapunov和对数生长条件;

相似文献

外文文献
中文文献
专利

1. RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES [J] . Guo Xianping, Liao Zhong-Wei SIAM Journal on Control and Optimization . 2019,第6期

机译：风险敏感折扣连续时间马尔可夫决策流程，具有无限性率
2. Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates [J] . Guo Xin, Liu Qiuli, Zhang Yi 4OR: Quarterly Journal of the Belgian, French and Italian Operations Research Societies . 2019,第4期

机译：有限地平线风险敏感的连续时间马尔可夫决策流程，具有无限的过渡和成本率
3. Discounted continuous-time Markov decision processes with unbounded rates and randomized history-dependent policies: the dynamic programming approach [J] . Alexey Piunovskiy, Yi Zhang 4OR: Quarterly Journal of the Belgian, French and Italian Operations Research Societies . 2014,第1期

机译：具有无限制利率和依赖历史的随机策略的折扣连续时间马尔科夫决策过程：动态规划方法
4. Iterated risk measures for risk-sensitive Markov decision processes with discounted cost [C] . Takayuki Osogami Uncertainty in artificial intelligence . 2011

机译：成本敏感的马尔可夫决策过程的迭代风险度量
5. Investigation of Computational Reduction Strategies for Markov Decision Processes [D] . Zhai, Jie. 2019

机译：马尔可夫决策过程计算减排策略调查
6. Learning to maximize reward rate: a model based on semi-Markov decision processes [O] . Arash Khodadadi, Pegah Fakhari, Jerome R. Busemeyer 2014

机译：学习最大化奖励率：基于半马尔可夫决策过程的模型
7. Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates [O] . Xin Guo, Qiuli Liu, Yi Zhang 2019

机译：有限地平线风险敏感的连续时间马尔可夫决策流程，具有无限的过渡和成本率
8. Countable State Discounted Markovian Decision Processes with Unbounded Rewards [R] . Harrison, J. M. 1970

机译：具有无限奖励的可数州折现马尔可夫决策过程

RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES

摘要

著录项

相似文献

相关主题

期刊订阅