...
首页> 外文期刊>Journal of Mathematical Analysis and Applications >Optimal policy for minimizing risk models in Markov decision processes
【24h】

Optimal policy for minimizing risk models in Markov decision processes

机译:最小化马尔可夫决策过程中风险模型的最佳策略

获取原文
获取原文并翻译 | 示例

摘要

We consider the minimizing risk problems in discounted Markov decisions processes with countable state space and bounded general rewards. We characterize optimal values for finite and infinite horizon cases and give two sufficient conditions for the existence of an optimal policy in an infinite horizon case. These conditions are closely connected with Lemma 3 in White (1993), which is not correct as Wu and Lin (1999) point out We obtain a condition for the lemma to be true, under which we show that there is an optimal policy. Under another condition we show that an optimal value is a unique solution to some optimality equation and there is an optimal policy on a transient set. (C) 2002 Elsevier Science (USA). All rights reserved. [References: 14]
机译:我们考虑在具有可数状态空间和有限一般报酬的折现马尔可夫决策过程中将风险问题最小化。我们描述了有限和无限情况下的最优值,并为存在无限条件下的最优策略给出了两个充分条件。这些条件与怀特(1993)中的引理3紧密相关,正如吴和林(1999)指出的那样,这是不正确的。我们获得了一个引理为真的条件,在此条件下,我们表明存在最优策略。在另一个条件下,我们表明最优值是某些最优性方程的唯一解,并且在暂态集上存在最优策略。 (C)2002 Elsevier Science(美国)。版权所有。 [参考:14]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号