FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

Huo Haifeng; Wen Xian

首页> 外文期刊>Kybernetika >FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

【24h】

FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

机译：连续时间马尔可夫决策过程的第一个通道风险概率最优性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerable state space, a Borel action space, unbounded transition rates and nonnegative reward function. The optimality criterion to be considered is the first passage risk probability criterion. To ensure the non-explosion of the state processes, we first introduce a so-called drift condition, which is weaker than the well known regular condition for semi-Markov decision processes (SMDPs). Furthermore, under some suitable conditions, by value iteration recursive approximation technique, we establish the optimality equation, obtain the uniqueness of the value function and the existence of optimal policies. Finally, two examples are used to illustrate our results.

机译：在本文中，我们研究了具有可数状态空间，Borel作用空间，无界转移率和非负奖励函数的连续时间马尔可夫决策过程（CTMDP）。要考虑的最佳标准是首次通过风险概率标准。为了确保状态过程不爆炸，我们首先引入所谓的漂移条件，该条件比半马氏决策过程（SMDP）的众所周知的常规条件要弱。此外，在一些合适的条件下，通过值迭代递推逼近技术，建立了最优性方程，得到了价值函数的唯一性和最优策略的存在性。最后，使用两个示例来说明我们的结果。

著录项

来源
《Kybernetika》 |2019年第1期|114-133|共20页
作者
Huo Haifeng; Wen Xian;
展开▼
作者单位

Guangxi Univ Sci & Technol, Sch Sci, Liuzhou 545006, Peoples R China;

Guangxi Univ Sci & Technol, Sch Sci, Liuzhou 545006, Peoples R China|Guangxi Univ Sci & Technol, Lushan Coll, Liuzhou 5450616, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
continuous time Markov decision processes; first passage time; risk probability criterion; optimal policy;

机译：连续时间马尔可夫决策过程;初次通过时间;风险概率准则;最优策略;

相似文献

外文文献
中文文献
专利

1. FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES [J] . Huo Haifeng, Wen Xian Kybernetika . 2019,第1期

机译：第一次通行风险概率概率最优，用于连续时间马尔可夫决策过程
2. ON THE FIRST PASSAGE g-MEAN-VARIANCE OPTIMALITY FOR DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES [J] . Guo Xianping, Huang Xiangxiang, Zhang Yi SIAM Journal on Control and Optimization . 2015,第3期

机译：连续马尔可夫决策过程的第一阶段g均值最优性研究
3. First Passage Optimality for Continuous-Time Markov Decision Processes With Varying Discount Factors and History-Dependent Policies [J] . Guo X., Song X., Zhang Y. IEEE Transactions on Automatic Control . 2014,第1期

机译：可变折扣因子和历史相关策略的连续时间马尔可夫决策过程的第一遍最优性
4. Time-Bounded Reachability Probabilities in Continuous-Time Markov Decision Processes [C] . Neuhausser Martin R., Zhang Lijun Seventh International Conference on the Quantitative Evaluation of Systems . 2010

机译：连续时间马尔可夫决策过程中的时间可及性概率
5. Risk -sensitive control of discrete -time partially observed Markov decision processes. [D] . Chuang, Dong-Ming. 1999

机译：离散时间部分观察到的马尔可夫决策过程的风险敏感控制。
6. General continuous-time Markov model of sequence evolution via insertions/deletions: are alignment probabilities factorable? [O] . Kiyoshi Ezawa 2016

机译：通过插入/缺失进行序列进化的一般连续时间马尔可夫模型：比对概率可分解吗？
7. On the First Passage $g$-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes [O] . Guo X, Huang X, Zhang Y 2015

机译：贴现连续时间马尔可夫决策过程的第一遍$ g $-均值最优性
8. On the Risk-Sensitive Optimality Criteria for Markov Decision Processes. [R] . sladky, karel 1975

机译：马尔可夫决策过程的风险敏感最优性准则。

FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

摘要

著录项

相似文献

相关主题

期刊订阅