...
首页> 外文期刊>Stochastics: An International Journal of Probability and Stochastic Processes >Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion
【24h】

Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion

机译:具有风险敏感平均准则的部分可观察的受控马尔可夫链的逐次逼近

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Partially observable Markov decision chains with finite state, action and signal spaces are considered. The performance index is the risk-sensitive average criterion and, under conditions concerning reachability between the unobservable states and observability of the signals, it is shown that the value iteration algorithm can be implemented to approximate the optimal average cost, to determine a stationary policy whose performance index is arbitrarily close to the optimal one, and to establish the existence of solutions to the optimality equation. The results rely on an appropriate extension of the well-known Schweitzer's transformation.
机译:考虑具有有限状态,动作和信号空间的部分可观察的马尔可夫决策链。性能指标是风险敏感的平均标准,并且在涉及不可观察状态之间的可达性和信号的可观察性的条件下,表明可以使用值迭代算法来逼近最佳平均成本,从而确定其性能指标可任意接近最优指标,并建立最优方程解的存在性。结果依赖于著名的Schweitzer变换的适当扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号