首页> 外文会议> >Certainty equivalence for imperfect information finite state-space stochastic games
【24h】

Certainty equivalence for imperfect information finite state-space stochastic games

机译:不完全信息有限状态空间随机博弈的确定性等价

获取原文
获取外文期刊封面目录资料

摘要

Stochastic games under imperfect information are typically computationally intractable even in the discrete-time/discrete-state case considered here. We consider a problem where one player has perfect information. A function of a conditional probability distribution is proposed as an information state. In the problem form here, the payoff is only a function of the terminal state of the system, and the initial information state is either linear or a sum of max-plus delta functions. When the initial information state belongs to these classes, its propagation is finite-dimensional. The state feedback value function is also finite-dimensional, and obtained via dynamic programming, but has a nonstandard form due to the necessity of an expanded state variable. Under a saddle point assumption, certainty equivalence is obtained and the proposed function is indeed an information state.
机译:即使在此处考虑的离散时间/离散状态情况下,具有不完善信息的随机博弈通常在计算上也是棘手的。我们考虑一个玩家拥有完善信息的问题。提出了条件概率分布的函数作为信息状态。在这里的问题形式中,收益仅是系统终端状态的函数,并且初始信息状态是线性的或最大加增量函数的总和。当初始信息状态属于这些类别时,其传播是有限维的。状态反馈值函数也是有限维的,并且是通过动态编程获得的,但是由于必须扩展状态变量,因此具有非标准形式。在鞍点假设下,获得了确定性等价,并且所建议的功能确实是一种信息状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号