首页> 外文会议> >Singularly perturbed Markov chains. II. Applications to controlled dynamic systems and Markov decision processes
【24h】

Singularly perturbed Markov chains. II. Applications to controlled dynamic systems and Markov decision processes

机译:奇摄动的马尔可夫链。二。在受控动态系统和马尔可夫决策过程中的应用

获取原文

摘要

For pt.I see ibid., p.1103-8 (1997). Markov decision process with weak and strong interactions, such that the states of the process can be divided into several groups such that transitions among the states within each group occur much more frequently than the transitions among the states belonging to different groups, are considered. By treating the states in each group as a single state, one derives a limit problem for discounted cost criteria. Given an optimal solution of the limit problem, one constructs a solution for the original problem which is asymptotically optimal. The paper discusses applications to Markov decision processes and nearly optimal controls of stochastic dynamic systems. Proofs are omitted.
机译:关于pt,我见同上,p.1103-8(1997)。具有弱交互作用和强交互作用的马尔可夫决策过程,因此可以将过程的状态分为几组,以使每组内的状态之间的转换比属于不同组的状态之间的转换更频繁地发生。通过将每个组中的状态视为一个状态,可以得出折价成本标准的极限问题。给定极限问题的最优解,就可以构造渐近最优的原始问题的解。本文讨论了马尔可夫决策过程和随机动态系统的近乎最优控制的应用。证明被省略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号