A Unification of Extensive-Form Games and Markov Decision Processes

机译：广义形式博弈与马尔可夫决策过程的统一

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a generalization of extensive-form games that greatly increases representational power while still allowing efficient computation in the zero-sum setting. A principal feature of our generalization is that it places arbitrary convex optimization problems at decision nodes, in place of the finite action sets typically considered. The possibly-infinite action sets mean we must "forget" the exact action taken (feasible solution to the optimization problem), remembering instead only some statistic sufficient for playing the rest of the game optimally. Our new model provides an exponentially smaller representation for some games; in particular, we show how to compactly represent (and solve) extensive-form games with outcome uncertainty and a generalization of Markov decision processes to multi-stage adversarial planning games.

机译：我们描述了广义形式博弈的一般化，该博弈极大地提高了表示能力，同时仍然允许零和设置下的有效计算。我们一般化的主要特征是它在决策节点处放置了任意凸优化问题，以代替通常考虑的有限动作集。可能无限的动作集意味着我们必须“忘记”所采取的确切动作（对优化问题的可行解决方案），而只能记住一些足以使游戏其余部分达到最佳状态的统计信息。我们的新模型为某些游戏提供了指数级较小的表示形式；特别是，我们展示了如何紧凑地表示（和解决）具有不确定结果的广泛形式的博弈，以及将马尔可夫决策过程推广到多阶段对抗计划博弈的过程。

著录项

来源
《AAAI Conference on Artificial Intelligence(AAAI-07); Innovative Applications of Artificial Intelligence Conference(IAAI-07); 20070722-26; 20070722-26; Vancouver(CA); Vancouver(CA)》|2007年|P.86-93|共8页
会议地点 Vancouver(CA);Vancouver(CA)
作者
H. Brendan McMahan; Geoffrey J. Gordon;
展开▼
作者单位

School of Computer Science Carnegie Mellon University Pittsburgh. PA 15213;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. The complexity of analyzing infinite-state Markov chains, Markov decision processes, and stochastic games (Invited talk) [J] . Kousha Etessami LIPIcs : Leibniz International Proceedings in Informatics . 2013,第1期

机译：分析无限状态马尔可夫链，马尔可夫决策过程和随机博弈的复杂性（特邀演讲）
2. Method for Constructing Artificial Intelligence Player With Abstractions to Markov Decision Processes in Multiplayer Game of Mahjong [J] . Kurita Moyuru, Hoki Kunihito IEEE Transactions on Games . 2021,第1期

机译：用抽象构建人工智能员的方法，以Mahjong多人游戏中的马尔可夫决策过程
3. Reducible Markov Decision Processes and Stochastic Games [J] . Ning Jie Production and operations management . 2021,第8期

机译：还原马尔可夫决策过程和随机游戏
4. A Unification of Extensive-Form Games and Markov Decision Processes [C] . H. Brendan McMahan, Geoffrey J. Gordon AAAI Conference on Artificial Intelligence . 2007

机译：广泛形式的游戏和马尔可夫决策过程的统一
5. Modern Methods of Hidden Markov Models and Partially Observable Markov Decision Processes in Biostatistics [D] . Xu, Zekun. 2020

机译：隐藏马尔可夫模型的现代方法和止痛性的部分可观察马尔可夫决策过程
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games [O] . Gabriele Farina, Christian Kroer, Tuomas Sandholm 2019

机译：用于顺序决策过程和广泛形式游戏的在线凸优化
8. Two Short Notes on Markov Processes: I. A Test for Sub-Optimal Actions in Markovian Decision Problems. II. An Intrinsically Determined Markov Chain [R] . MacQueen, J. B. 1966

机译：关于马尔可夫过程的两个简短说明：I。马尔可夫决策问题中次优最优行动的检验。 II。本质上确定的马尔可夫链

A Unification of Extensive-Form Games and Markov Decision Processes

摘要

著录项

相似文献

相关主题

期刊订阅