【24h】

A Unification of Extensive-Form Games and Markov Decision Processes

机译:广义形式博弈与马尔可夫决策过程的统一

获取原文
获取原文并翻译 | 示例

摘要

We describe a generalization of extensive-form games that greatly increases representational power while still allowing efficient computation in the zero-sum setting. A principal feature of our generalization is that it places arbitrary convex optimization problems at decision nodes, in place of the finite action sets typically considered. The possibly-infinite action sets mean we must "forget" the exact action taken (feasible solution to the optimization problem), remembering instead only some statistic sufficient for playing the rest of the game optimally. Our new model provides an exponentially smaller representation for some games; in particular, we show how to compactly represent (and solve) extensive-form games with outcome uncertainty and a generalization of Markov decision processes to multi-stage adversarial planning games.
机译:我们描述了广义形式博弈的一般化,该博弈极大地提高了表示能力,同时仍然允许零和设置下的有效计算。我们一般化的主要特征是它在决策节点处放置了任意凸优化问题,以代替通常考虑的有限动作集。可能无限的动作集意味着我们必须“忘记”所采取的确切动作(对优化问题的可行解决方案),而只能记住一些足以使游戏其余部分达到最佳状态的统计信息。我们的新模型为某些游戏提供了指数级较小的表示形式;特别是,我们展示了如何紧凑地表示(和解决)具有不确定结果的广泛形式的博弈,以及将马尔可夫决策过程推广到多阶段对抗计划博弈的过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号