...
首页> 外文期刊>Mathematical methods of operations research >Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions
【24h】

Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions

机译:具有风险敏感标准的受控马尔可夫链:平均成本,最优方程和最优解

获取原文
获取原文并翻译 | 示例

摘要

We study controlled Markov chains with denumerable state space and bounded costs per stage. A (long-rum) risk-sensitive average cost criterion, associated to an exponential utility function with a constant risk sensitivity coefficient, is used as a performance measure. The main assumption on the probabilistic structure of the model is that the transition law satisfies a simultaneous Doeblin condition. Working within this framework, the main results obtained can be summarized as follows: If the constant risk-sensitivity coefficient is small enough, then an associated optimality equation has a bounded solution with a constant value for the optimal risk-sensitive average cost; in addition, under further standard continuity-compactness as-sumptions, optimal stationary policies are obtained. However, it is also shown that the above conclusions fail to hold, in general, for large enough values of the risk-sensitivity coefficient. Our results therefore disprove previous claims on this topic. Also of importance is the fact that our developments are very much self-contained and employ only basic probabilistic and analysis principles.
机译:我们研究了具有无穷状态空间和每个阶段的受限成本的受控马尔可夫链。与具有恒定风险敏感性系数的指数效用函数相关的(长朗姆酒)风险敏感性平均成本准则用作绩效指标。该模型的概率结构的主要假设是过渡定律同时满足Doeblin条件。在此框架内工作,可以得出以下主要结果:如果恒定的风险敏感度系数足够小,则关联的最优方程具有最佳风险敏感平均成本的恒定值的有界解;另外,在进一步的标准连续性—紧凑性假设下,获得了最优的平稳策略。但是,还表明,对于风险敏感性系数足够大的值,上述结论通常无法成立。因此,我们的结果证明了先前对此主题的主张。同样重要的是,我们的发展非常独立,只采用基本的概率和分析原理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号