Just Add Pepper: Extending Learning Algorithms for Repeated Matrix Games to Repeated Markov Games

机译：只需添加Pepper：将重复矩阵游戏扩展到重复马尔可夫游戏的学习算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning in multi-agent settings has recently garnered much interest, the result of which has been the development of somewhat effective multi-agent learning (MAL) algorithms for repeated normal-form games. However, general-purpose MAL algorithms for richer environments, such as general sum repeated stochastic (Markov) games (RSGs), are less advanced. Indeed, previously created MAL algorithms for RSGs are typically successful only when the behavior of as sociates meets specific game theoretic assumptions and when the game is of a particular class (such as zero-sum games). In this paper, we present a new algorithm, called Pepper, that can be used to extend MAL algorithms designed for repeated normal-form games to RSGs. We demonstrate that Pepper creates a family of new algorithms, each of whose asymptotic performance in RSGs is reminiscent of its asymptotic performance in related repeated normal-form games. We also show that some algorithms formed with Pepper outperform existing algorithms in an interesting RSG.

机译：在多代理设置中学习最近获得了很多兴趣，这是一直是开发有点有效的多代理学习（MAL）算法，用于重复的正常形式游戏。然而，用于更丰富的环境的通用MAL算法，例如普通和重复的随机（马尔可夫）游戏（RSG），不太高级。实际上，才为RSGS创建的MAL算法通常仅成功，只有当同社会符合特定的游戏理论假设以及游戏的特定类别时（例如零和游戏）时，才会成功。在本文中，我们提出了一种新的算法，称为Pepper，可用于扩展为重复正常格式游戏设计的MAL算法。我们展示了Pepper创造了一系列新算法，每个算法在RSG中的渐近性表现都让人想起其在相关的反复正常形式游戏中的渐近性能。我们还表明，一些用辣椒形成的算法优于现有的现有算法，以有趣的RSG。

著录项

来源
《International Conference on Autonomous Agents and Multiagent Systems》|2012年||共8页
会议地点
作者
Jacob W. Crandall;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.1-53;
关键词
Multi-agent learning; Stochastic games; Game theory;

机译：多代理学习;随机游戏;游戏理论;

相似文献

外文文献
中文文献
专利

1. Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning [J] . Jacob W. Crandall, Michael A. Goodrich Machine Learning . 2011,第3期

机译：使用强化学习来学习在重复游戏中的竞争，协调和合作
2. Markov Perfect equilibria in repeated asynchronous choice games [J] . Haller H., Lagunoff R. Journal of Mathematical Economics . 2010,第6期

机译：重复异步选择游戏中的马尔可夫完美均衡
3. Strategic adaptation of humans playing computer algorithms in a repeated constant-sum game [J] . Leonidas Spiliopoulos Autonomous agents and multi-agent systems . 2013,第1期

机译：在重复的恒定和游戏中玩计算机算法的人类的战略适应
4. Just Add Pepper: Extending Learning Algorithms for Repeated Matrix Games to Repeated Markov Games [C] . Jacob W. Crandall International Conference on Autonomous Agents and Multiagent Systems . 2012

机译：只需添加Pepper：将重复矩阵游戏扩展到重复马尔可夫游戏的学习算法
5. Repeated-game models of competitive electricity markets: Formulations and algorithms. [D] . Liu, Andrew Lu. 2009

机译：电力竞争市场的重复博弈模型：公式和算法。
6. Learning with repeated-game strategies [O] . Christos A. Ioannou, Julian Romero 2014

机译：通过重复游戏策略学习
7. Hedging Algorithms and Repeated Matrix Games [O] . Bouzy Bruno, Métivier Marc, Pellier Damien 2011

机译：套期保值算法和重复矩阵博弈

Just Add Pepper: Extending Learning Algorithms for Repeated Matrix Games to Repeated Markov Games

摘要

著录项

相似文献

相关主题

期刊订阅