FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

Huo Haifeng; Wen Xian

首页> 外文期刊>Kybernetika >FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

【24h】

FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

机译：第一次通行风险概率概率最优，用于连续时间马尔可夫决策过程

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerable state space, a Borel action space, unbounded transition rates and nonnegative reward function. The optimality criterion to be considered is the first passage risk probability criterion. To ensure the non-explosion of the state processes, we first introduce a so-called drift condition, which is weaker than the well known regular condition for semi-Markov decision processes (SMDPs). Furthermore, under some suitable conditions, by value iteration recursive approximation technique, we establish the optimality equation, obtain the uniqueness of the value function and the existence of optimal policies. Finally, two examples are used to illustrate our results.

机译：在本文中，我们研究了连续时间马尔可夫决策过程（CTMDPS），具有可降价的状态空间，Borel Action Space，无绑定的过渡率和非负奖励功能。要考虑的最优标准是第一段段风险概率标准。为了确保国家流程的非爆炸，我们首先引入所谓的漂移条件，这比半马尔可夫决策过程（SMDPS）的众所周知的常规条件弱。此外，在一些合适的条件下，通过价值迭代递归近似技术，我们建立了最优性方程，获得了价值函数的唯一性和最佳策略的存在。最后，使用两个例子来说明我们的结果。

著录项

来源
《Kybernetika》 |2019年第1期|114-133|共20页
作者
Huo Haifeng; Wen Xian;
展开▼
作者单位

Guangxi Univ Sci & Technol Sch Sci Liuzhou 545006 Peoples R China;

Guangxi Univ Sci & Technol Sch Sci Liuzhou 545006 Peoples R China|Guangxi Univ Sci & Technol Lushan Coll Liuzhou 5450616 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
continuous time Markov decision processes; first passage time; risk probability criterion; optimal policy;

机译：连续时间马尔可夫决策过程;第一次通过时间;风险概率标准;最优政策;

相似文献

外文文献
中文文献
专利

1. FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES [J] . Huo Haifeng, Wen Xian Kybernetika . 2019,第1期

机译：连续时间马尔可夫决策过程的第一个通道风险概率最优性
2. ON THE FIRST PASSAGE g-MEAN-VARIANCE OPTIMALITY FOR DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES [J] . Guo Xianping, Huang Xiangxiang, Zhang Yi SIAM Journal on Control and Optimization . 2015,第3期

机译：连续马尔可夫决策过程的第一阶段g均值最优性研究
3. First Passage Optimality for Continuous-Time Markov Decision Processes With Varying Discount Factors and History-Dependent Policies [J] . Guo X., Song X., Zhang Y. IEEE Transactions on Automatic Control . 2014,第1期

机译：可变折扣因子和历史相关策略的连续时间马尔可夫决策过程的第一遍最优性
4. Time-Bounded Reachability Probabilities in Continuous-Time Markov Decision Processes [C] . Neuhausser Martin R., Zhang Lijun Seventh International Conference on the Quantitative Evaluation of Systems . 2010

机译：连续时间马尔可夫决策过程中的时间可及性概率
5. Risk -sensitive control of discrete -time partially observed Markov decision processes. [D] . Chuang, Dong-Ming. 1999

机译：离散时间部分观察到的马尔可夫决策过程的风险敏感控制。
6. General continuous-time Markov model of sequence evolution via insertions/deletions: are alignment probabilities factorable? [O] . Kiyoshi Ezawa 2016

机译：通过插入/缺失进行序列进化的一般连续时间马尔可夫模型：比对概率可分解吗？
7. On the First Passage $g$-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes [O] . Guo X, Huang X, Zhang Y 2015

机译：贴现连续时间马尔可夫决策过程的第一遍$ g $-均值最优性
8. On the Risk-Sensitive Optimality Criteria for Markov Decision Processes. [R] . sladky, karel 1975

机译：马尔可夫决策过程的风险敏感最优性准则。

FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

摘要

著录项

相似文献

相关主题

期刊订阅