首页> 外文会议>IEEE Conference on Decision and Control >Singularly perturbed Markov chains. II. Applications to controlled dynamic systems and Markov decision processes
【24h】

Singularly perturbed Markov chains. II. Applications to controlled dynamic systems and Markov decision processes

机译:奇异于扰动的马尔可夫链。 II。应用于控制动态系统和马尔可夫决策过程

获取原文

摘要

For pt.I see ibid., p.1103-8 (1997). Markov decision process with weak and strong interactions, such that the states of the process can be divided into several groups such that transitions among the states within each group occur much more frequently than the transitions among the states belonging to different groups, are considered. By treating the states in each group as a single state, one derives a limit problem for discounted cost criteria. Given an optimal solution of the limit problem, one constructs a solution for the original problem which is asymptotically optimal. The paper discusses applications to Markov decision processes and nearly optimal controls of stochastic dynamic systems. Proofs are omitted.
机译:对于pt.i查看同上。,p.1103-8(1997)。马尔可夫决策过程具有薄弱和强烈的相互作用,使得该过程的状态可以分为几个组,使得每组内的各州的转换比属于不同组的国家之间的转换更频繁地发生得多。通过将各组的各国视为单一国家,获得贴现成本标准的限制问题。鉴于极限问题的最佳解决方案,一个构建了对渐近最佳的原始问题的解决方案。本文讨论了Markov决策过程的应用以及随机动态系统的几乎最优控制。省略证明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号