Optimal policy for minimizing risk models in Markov decision processes

Ohtsubo Y.; Toyonaga K.

首页> 外文期刊>Journal of Mathematical Analysis and Applications >Optimal policy for minimizing risk models in Markov decision processes

【24h】

Optimal policy for minimizing risk models in Markov decision processes

机译：最小化马尔可夫决策过程中风险模型的最佳策略

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the minimizing risk problems in discounted Markov decisions processes with countable state space and bounded general rewards. We characterize optimal values for finite and infinite horizon cases and give two sufficient conditions for the existence of an optimal policy in an infinite horizon case. These conditions are closely connected with Lemma 3 in White (1993), which is not correct as Wu and Lin (1999) point out We obtain a condition for the lemma to be true, under which we show that there is an optimal policy. Under another condition we show that an optimal value is a unique solution to some optimality equation and there is an optimal policy on a transient set. (C) 2002 Elsevier Science (USA). All rights reserved. [References: 14]

机译：我们考虑在具有可数状态空间和有限一般报酬的折现马尔可夫决策过程中将风险问题最小化。我们描述了有限和无限情况下的最优值，并为存在无限条件下的最优策略给出了两个充分条件。这些条件与怀特（1993）中的引理3紧密相关，正如吴和林（1999）指出的那样，这是不正确的。我们获得了一个引理为真的条件，在此条件下，我们表明存在最优策略。在另一个条件下，我们表明最优值是某些最优性方程的唯一解，并且在暂态集上存在最优策略。（C）2002 Elsevier Science（美国）。版权所有。 [参考：14]

著录项

来源
《Journal of Mathematical Analysis and Applications 》 |2002年第1期| 共16页
作者
Ohtsubo Y.; Toyonaga K.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数学 ;
关键词
Markov decision process; Minimizing risk model; Maximal fixed point; Existence of optimal policy; Variance;

机译：马尔可夫决策过程;最小化风险模型;最大固定点;最优策略的存在;方差;

相似文献

外文文献
中文文献
专利

1. Optimal policy for minimizing risk models in Markov decision processes [J] . Ohtsubo Y., Toyonaga K. Journal of Mathematical Analysis and Applications . 2002 ,第1期

机译：最小化马尔可夫决策过程中风险模型的最佳策略
2. Optimum inspection and maintenance policies for corroded structures using partially observable Markov decision processes and stochastic, physically based models [J] . K.G. Papakonstantinou, M. Shinozuka Probabilistic engineering mechanics . 2014 ,第jula期

机译：使用部分可观察的马尔可夫决策过程和基于物理的随机模型对腐蚀结构进行最佳检查和维护，
3. Spatial modelling of natural disaster risk reduction policies with Markov decision processes [J] . Espada Rodolfo Jr., Apan Armando, McDougall Kevin Applied Geography . 2014 ,第Null期

机译：利用马尔可夫决策过程进行自然灾害风险降低政策的空间建模
4. Minimizing Risk Models in Denumerable Semi-Markov Decision Processes with a Target Set [C] . HUANG Yonghui, GUO Xianping Proceedings of the 29th Chinese Control Conference . 2010

机译：具有目标集的可数半马尔可夫决策过程中的风险模型最小化
5. Modern Methods of Hidden Markov Models and Partially Observable Markov Decision Processes in Biostatistics [D] . Xu, Zekun. 2020

机译：隐藏马尔可夫模型的现代方法和止痛性的部分可观察马尔可夫决策过程
6. Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play [O] . Sherif Abdelfattah, Kathryn Kasmarik, Jiankun Hu 2018

机译：通过内在动机的自我博弈在多目标马尔可夫决策过程中发展稳健的政策覆盖范围
7. Optimal policy for minimizing risk models in Markov decision processes [O] . Ohtsubo Y., Toyonaga K. 2002

机译：最小化马尔可夫决策过程中风险模型的最佳策略
8. On the Risk-Sensitive Optimality Criteria for Markov Decision Processes. [R] . sladky, karel 1975

机译：马尔可夫决策过程的风险敏感最优性准则。

Optimal policy for minimizing risk models in Markov decision processes

摘要

著录项

相似文献

相关主题

期刊订阅