...
首页> 外文期刊>高分子論文集 >Approximating maxmin strategies in imperfect recall games using A-loss recall property
【24h】

Approximating maxmin strategies in imperfect recall games using A-loss recall property

机译:使用A损失召回属性在不完全召回游戏中近似maxmin策略

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

AbstractExtensive-form games with imperfect recall are an important model of dynamic games where the players are allowed to forget previously known information. Often, imperfect recall games result from an abstraction algorithm that simplifies a large game with perfect recall. Solving imperfect recall games is known to be a hard problem, and thus it is useful to search for a subclass of imperfect recall games which offers sufficient memory savings while being efficiently solvable. The abstraction process can then be guided to result in a game from this class. We focus on a subclass of imperfect recall games called A-loss recall games. First, we provide a complete picture of the complexity of solving imperfect recall and A-loss recall games. We show that the A-loss recall property allows us to compute a best response in polynomial time (computing a best response isNP-hard in imperfect recall games). This allows us to create a practical algorithm for approximating maxmin strategies in two-player games where the maximizing player has imperfect recall and the minimizing player has A-loss recall. This algorithm is capable of solving some games with up to5109states in approximately 1 hour. Finally, we demonstrate that the use of imperfect recall abstraction can reduce the size of the strategy representation to as low as0.03%of the size of the strategy representation in the original perfect recall game without sacrificing the quality of the maxmin strategy obtained by solving this abstraction.
机译: 摘要 召回方式不完善的广泛形式的游戏是动态游戏的重要模型,允许玩家忘记以前知道的信息。通常,不完善的召回游戏是由抽象算法导致的,该算法简化了具有完美召回功能的大型游戏。解决不完善的召回游戏众所周知是一个难题,因此搜索不完善的召回游戏的子类非常有用,该子类可以节省足够的内存,同时又可以有效解决。然后,可以指导抽象过程以产生此类的游戏。我们专注于不完善的召回游戏的子类,称为A-损失召回游戏。首先,我们提供了解决不完善召回和A损失召回游戏的复杂性的完整图片。我们证明了A损失回想属性使我们能够在多项式时间内计算出最佳响应(计算最佳响应为 NP -在不完善的召回游戏中很难) 。这使我们能够创建一种实用的算法来近似两人游戏中的最大化策略,其中最大化的玩家召回率不理想,而最小的玩家则追回A损失。此算法最多可以解决某些游戏,并且游戏的 5 10 9 状态大约需要1个小时。最后,我们证明了使用不完善的召回抽象可以将策略表示的大小减小到 0.03 的大小原始完美召回游戏中的策略表示,而不牺牲通过解决这种抽象而获得的maxmin策略的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号