首页> 外文期刊>Mathematical Methods of Operations Research >Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria
【24h】

Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria

机译:有限马尔可夫决策链的平均成本最优方程的解:风险敏感和风险中立的准则

获取原文
获取原文并翻译 | 示例

摘要

This work is concerned with controlled Markov chains with finite state and action spaces. It is assumed that the decision maker has an arbitrary but constant risk sensitivity coefficient, and that the performance of a control policy is measured by the long-run average cost criterion. Within this framework, the existence of solutions of the corresponding risk-sensitive optimality equation for arbitrary cost function is characterized in terms of communication properties of the transition law.
机译:这项工作涉及具有有限状态和动作空间的受控马尔可夫链。假定决策者具有任意但恒定的风险敏感度系数,并且控制策略的性能由长期平均成本准则衡量。在此框架内,根据转移定律的通信性质,对任意成本函数的相应风险敏感最优方程的解的存在性进行了描述。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号