首页> 美国政府科技报告 >Partially Observable Markov Decision Processes with an Average Cost Criterion.
【24h】

Partially Observable Markov Decision Processes with an Average Cost Criterion.

机译:具有平均成本准则的部分可观察马尔可夫决策过程。

获取原文

摘要

We consider partially observable Markov decision processes with finite or countable (core) state and observation spaces and finite control space. Following a standard approach, an equivalent completely observed problem is formulated, with the same finite control space but with an uncountable state space, namely the space of probability distributions on the original core state space. It is observed that some characteristics induced in the original problem due to the finiteness, or countability, of the spaces involved are reflected onto the equivalent problem. Sufficient conditions are then derived for a bounded solution to the average cost optimality equation to exist. We illustrate these results in the context of machine replacement problems. Structural properties for average cost policies are obtained for a two state replacement problem, similar to available results for discount optimal policies. The set of assumptions used seems to be significantly less restrictive than others currently available. Keywords: Reprints. (kr)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号