首页> 外文期刊>Journal of Mathematical Analysis and Applications >NEW CLASS OF POLICIES IN VECTOR-VALUED MARKOV DECISION PROCESSES
【24h】

NEW CLASS OF POLICIES IN VECTOR-VALUED MARKOV DECISION PROCESSES

机译:价值向量马尔可夫决策过程中的新一类策略

获取原文
获取原文并翻译 | 示例
           

摘要

For a vector-valued Markov decision process with discounted reward criterion, we introduce a new class of policies called the semi-stationary policies and show that an optimal semi-stationary policy that attains the extreme points of the set of rewards induced by all policies can be described as a combination of optimal stationary policies. (C) 1996 Academic Press, Inc. [References: 5]
机译:对于具有折现奖励标准的向量值Markov决策过程,我们引入了一种称为半平稳策略的新策略,并证明了达到所有策略所引发的奖励集的极点的最优半平稳策略可以被描述为最佳固定政策的组合。 (C)1996 Academic Press,Inc. [参考:5]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号