RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES

Sladky Karel

首页> 外文期刊>Kybernetika >RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES

【24h】

RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES

机译：马氏决策过程中的风险敏感平均最优

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this note attention is focused on finding policies optimizing risk-sensitive optimality criteria in Markov decision chains. To this end we assume that the total reward generated by the Markov process is evaluated by an exponential utility function with a given risk-sensitive coefficient. The ratio of the first two moments depends on the value of the risk-sensitive coefficient; if the risk-sensitive coefficient is equal to zero we speak on risk-neutral models. Observe that the first moment of the generated reward corresponds to the expectation of the total reward and the second central moment of the reward variance.For communicating Markov processes and for some specific classes of unichain processes long run risk-sensitive average reward is independent of the starting state. In this note we present necessary and sufficient condition for existence of optimal policies independent of the starting state in unichain models and characterize the class of average risk-sensitive optimal policies.

机译：在本文中，注意力集中在寻找在马尔可夫决策链中优化风险敏感最优标准的政策。为此，我们假设由马尔可夫过程产生的总回报是由具有给定风险敏感系数的指数效用函数评估的。前两个时刻的比率取决于风险敏感系数的值；如果风险敏感系数等于零，我们将使用风险中立模型。观察到，所产生的报酬的第一时刻对应于总报酬的期望值和报酬方差的第二中心时刻。对于沟通马尔可夫过程和某些特定类型的单链过程，长期的风险敏感平均报酬与起始状态。在本说明中，我们提出了与单链模型中的初始状态无关的最优策略的存在的充要条件，并描述了平均风险敏感型最优策略的类别。

著录项

来源
《Kybernetika 》 |2018年第6期| 1218-1230| 共13页
作者
Sladky Karel;
展开▼
作者单位

Czech Acad Sci, Inst Informat Theory & Automat, Vodarenskou Vezi 4, Prague 18208 8, Czech Republic;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
controlled Markov processes; finite state space; asymptotic behavior; risk-sensitive average optimality;

机译：受控马尔可夫过程有限状态空间渐近行为风险敏感平均最优性;

相似文献

外文文献
中文文献
专利

1. Risk-sensitive average optimality in Markov decision processes [J] . Karel Sladky Kybernetika . 2018 ,第6期

机译：马尔可夫决策过程中风险敏感的平均最优
2. Solution to the risk-sensitive average cost optimality equation in a class of Markov decision processes with finite state space [J] . Rolando Cavazos-Cadena Mathematical methods of operations research . 2003 ,第2期

机译：具有状态空间的一类马尔可夫决策过程中风险敏感的平均成本最优方程的求解
3. Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains [J] . Cavazos-Cadena R Mathematical methods of operations research . 2010 ,第1期

机译：一类风险敏感的平均成本马尔可夫决策链中的最优性方程和不等式
4. On weak conditions and optimality inequality solutions in risk-sensitive controlled Markov processes with average criterion [C] . Brau-Rojas, A., Fernandez-Gaucherand, . 2002

机译：风险敏感的受控马尔可夫过程的弱条件和最优不等式解的平均判据
5. Controlled Markov chains with risk-sensitive average cost criterion. [D] . Brau Rojas, Agustin. 1999

机译：具有风险敏感平均成本准则的受控马尔可夫链。
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Risk-sensitive optimal control for Markov decision processes with monotone cost [O] . V. S. Borkar, S. P. Meyn 2016

机译：具有单调成本的马尔可夫决策过程的风险敏感最优控制
8. On the Risk-Sensitive Optimality Criteria for Markov Decision Processes. [R] . sladky, karel 1975

机译：马尔可夫决策过程的风险敏感最优性准则。

RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES

摘要

著录项

相似文献

相关主题

期刊订阅