...
首页> 外文期刊>European Journal of Control >Performance Optimization Of Semi-markov Decision Processes With Discounted-cost Criteria
【24h】

Performance Optimization Of Semi-markov Decision Processes With Discounted-cost Criteria

机译:具有折扣成本准则的半马尔可夫决策过程的性能优化

获取原文
获取原文并翻译 | 示例
           

摘要

We discuss the problems of discounted-cost performance optimization for a class of semi-Markov decision processes (SMDPs). We define a matrix which can be used as the infinitesimal generator of a Markov process. The discounted Poisson equation is proposed for an SMDP by using this matrix, from which the α-potential is defined. The optimality equation satisfied by the optimal stationary policy is given and the relation between discounted model and average model is discussed. Two iteration algorithms to find ∈-optimal policies are proposed and the proofs of convergence of these two algorithms are given. A numerical example is provided to illustrate the application of the algorithms.
机译:我们讨论了一类半马尔可夫决策过程(SMDP)的折扣成本性能优化问题。我们定义一个矩阵,该矩阵可用作马尔可夫过程的无穷小生成器。通过使用该矩阵为SMDP提出了折扣泊松方程,从中定义了α势。给出了最优平稳策略满足的最优方程,讨论了折现模型与平均模型之间的关系。提出了两种寻找ε最优策略的迭代算法,并给出了这两种算法的收敛性证明。提供了一个数值示例来说明算法的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号