首页> 外文期刊>Dynamic games and applications >On Canonical Forms for Zero-Sum Stochastic Mean Payoff Games
【24h】

On Canonical Forms for Zero-Sum Stochastic Mean Payoff Games

机译:零和随机均值支付游戏的规范形式

获取原文
获取原文并翻译 | 示例
           

摘要

We consider two-person zero-sum mean payoff undiscounted stochastic games and obtain sufficient conditions for the existence of a saddle point in uniformly optimal stationary strategies. Namely, these conditions enable us to bring the game, by applying potential transformations, to a canonical form in which locally optimal strategies are globally optimal, and hence the value for every initial position and the optimal strategies of both players can be obtained by playing the local game at each state. We show that these conditions hold for the class of additive transition (AT) games, that is, the special case when the transitions at each state can be decomposed into two parts, each controlled completely by one of the two players. An important special case of AT-games form the so-called BWR-games which are played by two players on a directed graph with positions of three types: Black, White and Random. We give an independent proof for the existence of a canonical form in such games, and use this result to derive the existence of a canonical form (and hence, of a saddle point in uniformly optimal stationary strategies) in a wide class of games, which includes stochastic games with perfect information (Pl), switching controller (SC) games and additive rewards, additive transition (ARAT) games. Unlike the proof for AT-games, our proof for the BWR-case does not rely on the existence of a saddle point in stationary strategies. We also derive some algorithmic consequences from these our reductions to BWR-games, in terms of solving PI-, and ARAT-games in sub-exponential time.
机译:我们考虑了两人零和平均收益无折扣的随机博弈,并为一致最优平稳策略中的鞍点的存在获得了充分的条件。也就是说,这些条件使我们能够通过应用潜在的转换,将游戏带入一种规范形式,在这种形式中局部最优策略是全局最优的,因此,通过玩游戏,可以获得每个初始位置的价值和两个玩家的最优策略。每个州的本地游戏。我们证明了这些条件适用于加性过渡(AT)游戏类别,即特殊情况,即每个状态的过渡可以分解为两个部分,每个部分完全由两个参与者之一控制。 AT游戏的一个重要特例是所谓的BWR游戏,由两个玩家在有向图上以三种类型的位置进行游戏:黑,白和随机。我们给出了此类游戏中规范形式存在的独立证明,并使用此结果来推导广泛类游戏中规范形式的存在(因此,在统一最优固定策略中存在鞍点)。包括具有完善信息(Pl)的随机游戏,切换控制器(SC)游戏和附加奖励,附加过渡(ARAT)游戏。与用于AT游戏的证明不同,我们针对BWR情况的证明不依赖平稳策略中鞍点的存在。我们还通过在亚指数时间内解决PI和ARAT游戏,从减少BWR游戏中得出了一些算法上的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号