首页> 外文学位 >Average optimality in infinite horizon optimization.
【24h】

Average optimality in infinite horizon optimization.

机译:无限期优化中的平均最优。

获取原文
获取原文并翻译 | 示例

摘要

We study three classes of infinite horizon optimization problems: the undiscounted homogeneous Markov decision process, the undiscounted nonhomogeneous Markov decision process, and the undiscounted deterministic problem.;To solve the undiscounted homogeneous Markov decision process, we give a sufficient condition for the existence of a stationary optimal strategy using the Doeblin coefficient. We also compare this condition with others in the literature. This condition allows transformation of the original problem into an equivalent discounted homogeneous Markov decision process for which a solution method is known.;For the undiscounted nonhomogeneous Markov decision process, it is known that an algorithmically optimal strategy is average optimal. Based on it, we present two solution methodologies using transformations into equivalent problems. Both procedures first transform the original problem into an equivalent discounted nonhomogeneous Markov decision process using the Doeblin coefficient. Then, the first (second) procedure transforms it into an equivalent discounted deterministic problem (discounted homogeneous Markov decision process) whose solution method is known.;For undiscounted deterministic problems, it is not known whether an algorithmically optimal strategy is average optimal or not. We present sufficient conditions for it, and apply the result to a production planning problem and a Markov decision process.
机译:我们研究了三类无限视野优化问题:无折扣齐次马尔可夫决策过程,无折扣非齐次马尔可夫决策过程和无折扣确定性问题;;为解决无折扣齐次马尔可夫决策过程,我们为存在无条件齐次马尔可夫决策过程提供了充分条件。使用Doeblin系数的平稳最优策略。我们还将这种情况与文献中的其他情况进行比较。该条件允许将原始问题转换为已知的求解方法的等效折现齐次马尔可夫决策过程。对于非折扣非齐次马尔可夫决策过程,已知算法最优策略是平均最优的。基于此,我们提出了两种使用转化为等效问题的解决方法。这两个过程都首先使用Doeblin系数将原始问题转换为等效的折现非齐次Markov决策过程。然后,第一个(第二个)过程将其转换为已知的折现确定性问题(折扣齐次马尔可夫决策过程),其求解方法是已知的;对于非折扣确定性问题,未知算法最优策略是否为平均最优。我们为此提供了充分的条件,并将结果应用于生产计划问题和Markov决策过程。

著录项

  • 作者

    Park, Yunsun.;

  • 作者单位

    University of Michigan.;

  • 授予单位 University of Michigan.;
  • 学科 Engineering Industrial.
  • 学位 Ph.D.
  • 年度 1990
  • 页码 127 p.
  • 总页数 127
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号