Approximating maxmin strategies in imperfect recall games using A-loss recall property

Jiří Čermák; Branislav Bošanský; Karel Horák; Viliam Lisý; Michal Pěchouček

首页> 外文期刊>高分子論文集 >Approximating maxmin strategies in imperfect recall games using A-loss recall property

【24h】

Approximating maxmin strategies in imperfect recall games using A-loss recall property

机译：使用A损失召回属性在不完全召回游戏中近似maxmin策略

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract

Extensive-form games with imperfect recall are an important model of dynamic games where the players are allowed to forget previously known information. Often, imperfect recall games result from an abstraction algorithm that simplifies a large game with perfect recall. Solving imperfect recall games is known to be a hard problem, and thus it is useful to search for a subclass of imperfect recall games which offers sufficient memory savings while being efficiently solvable. The abstraction process can then be guided to result in a game from this class. We focus on a subclass of imperfect recall games called A-loss recall games. First, we provide a complete picture of the complexity of solving imperfect recall and A-loss recall games. We show that the A-loss recall property allows us to compute a best response in polynomial time (computing a best response isNP-hard in imperfect recall games). This allows us to create a practical algorithm for approximating maxmin strategies in two-player games where the maximizing player has imperfect recall and the minimizing player has A-loss recall. This algorithm is capable of solving some games with up to5⋅109states in approximately 1 hour. Finally, we demonstrate that the use of imperfect recall abstraction can reduce the size of the strategy representation to as low as0.03%of the size of the strategy representation in the original perfect recall game without sacrificing the quality of the maxmin strategy obtained by solving this abstraction.

机译：

摘要

召回方式不完善的广泛形式的游戏是动态游戏的重要模型，允许玩家忘记以前知道的信息。通常，不完善的召回游戏是由抽象算法导致的，该算法简化了具有完美召回功能的大型游戏。解决不完善的召回游戏众所周知是一个难题，因此搜索不完善的召回游戏的子类非常有用，该子类可以节省足够的内存，同时又可以有效解决。然后，可以指导抽象过程以产生此类的游戏。我们专注于不完善的召回游戏的子类，称为A-损失召回游戏。首先，我们提供了解决不完善召回和A损失召回游戏的复杂性的完整图片。我们证明了A损失回想属性使我们能够在多项式时间内计算出最佳响应（计算最佳响应为 NP -在不完善的召回游戏中很难）。这使我们能够创建一种实用的算法来近似两人游戏中的最大化策略，其中最大化的玩家召回率不理想，而最小的玩家则追回A损失。此算法最多可以解决某些游戏，并且游戏的 5 ⋅ 10 9 状态大约需要1个小时。最后，我们证明了使用不完善的召回抽象可以将策略表示的大小减小到 0.03 ％的大小原始完美召回游戏中的策略表示，而不牺牲通过解决这种抽象而获得的maxmin策略的质量。

著录项

来源
《高分子論文集》 |2018年第2期|290-326|共37页
作者
Jiří Čermák; Branislav Bošanský; Karel Horák; Viliam Lisý; Michal Pěchouček;
展开▼
作者单位

Department of Computer Science, Czech Technical University in Prague;

Department of Computer Science, Czech Technical University in Prague;

Department of Computer Science, Czech Technical University in Prague;

Department of Computer Science, Czech Technical University in Prague;

Department of Computer Science, Czech Technical University in Prague;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Imperfect recall; Abstraction; Maxmin strategy; A-loss recall;

机译：不完美召回;抽象;Maxmin策略;A损失召回;

相似文献

外文文献
中文文献
专利

1. Automated construction of bounded-loss imperfect-recall abstractions in extensive-form games [J] . Jin Cermak, Viliam Lisy, Branislav Bosansky Artificial intelligence . 2020,第May期

机译：自动构建广泛形式游戏中的有限损失不完全召回抽象
2. On equilibria in games with imperfect recall [J] . Lambert Nicolas S., Marple Adrian, Shoham Yoav Games and economic behavior . 2019,第期

机译：关于奥尔福尔召回的游戏的均衡
3. Application of the Eisert-Wilkens-Lewenstein quantum game scheme to decision problems with imperfect recall [J] . Frckiewicz P. Journal of physics, A. Mathematical and theoretical . 2011,第32期

机译：Eisert-Wilkens-Lewenstein量子博弈方案在不完全召回的决策问题中的应用
4. Combining Incremental Strategy Generation and Branch and Bound Search for Computing Maxmin Strategies in Imperfect Recall Games [C] . Jiri Cermak, Branislav Bosansky, Michal Pechoucek International Conference on Autonomous Agents and Multiagent Systems . 2018

机译：在不完美召回游戏中结合增量策略生成和分支和绑定搜索计算Maxmin策略
5. Three essays in microeconomics: Uncertain innovation in the presence of network externalities. Copyleft: An R&D game with network externalities. Imperfect recall in a model of search. [D] . Subramanian, Prita. 2000

机译：微观经济学的三篇论文：存在网络外部性时不确定的创新。 Copyleft：具有网络外部性的R＆D游戏。搜索模型中的召回不完善。
6. Recall and recognition of in-game advertising: the role of game control [O] . Laura Herrewijn, Karolien Poels 2013

机译：召回和认可游戏内广告：游戏控制的作用
7. Computing Maxmin Strategies in Extensive-Form Zero-Sum Games with Imperfect Recall [O] . Bosansky, Branislav, Cermak, Jiri, Horak, Karel, 2017

机译：用广域网计算广义零和游戏中的maxmin策略不完美的召回
8. On MaxMin and MinMax Strategies in Multi-Stage Games and ATACM. [R] . Anderson, L. B., Bracken, J., Falk, J. E., 1976

机译：关于多阶段游戏和aTaCm中的maxmin和minmax策略。

Approximating maxmin strategies in imperfect recall games using A-loss recall property

摘要

著录项

相似文献

相关主题

期刊订阅