首页> 外文期刊>Mathematical methods of operations research >A decomposition approach for undiscounted two-person zero-sum stochastic games
【24h】

A decomposition approach for undiscounted two-person zero-sum stochastic games

机译:无折扣两人零和随机游戏的分解方法

获取原文
获取原文并翻译 | 示例
           

摘要

Two-person zero-sum stochastic games are considered under the long-run average expected payoff criterion. State and action spaces are assumed finite. By making use of the concept of maximal communicating classes, the following decomposition algorithm is introduced for solving two-person zero-sum stochastic games: First the state space is decomposed into maximal communicating classes. Then, these classes are organized in an hierarchical order where each level may contain more than one maximal communicating class. Best stationary strategies for the states in a maximal communicating class at a level are determined by using the best stationary strategies of the states in the previous levels that are accessible from that class. At the initial level, a restricted games is defined for each closed maximal communicating class and these restricted games are solved independently. It is shown that the proposed decomposition algorithm is exact in the sense that the solution obtained from the decomposition procedure gives the best stationary strategies for the original stochastic game.
机译:在长期平均预期收益标准下,考虑了两人零和随机游戏。状态和动作空间被认为是有限的。利用最大通信类的概念,引入以下分解算法求解两人零和随机游戏:首先将状态空间分解为最大通信类。然后,这些类按层次结构顺序组织,其中每个级别可以包含一个以上的最大通信类。通过使用从该类别可访问的先前级别中的状态的最佳静态策略,确定该级别的最大通信类中的状态的最佳静态策略。在初始级别,为每个封闭的最大通信类定义一个受限游戏,并且这些受限游戏是独立解决的。结果表明,从分解过程中获得的解给出了原始随机博弈的最佳平稳策略的意义上,所提出的分解算法是准确的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号