首页> 外文会议>International conference on autonomous agents and multiagent systems;AAMAS 2011 >Sequential targeted optimality as a new criterion for teaching and following in repeated games

【24h】

Sequential targeted optimality as a new criterion for teaching and following in repeated games

机译：顺序目标最优作为重复游戏中教学和跟随的新标准

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In infinitely repeated games, the act of teaching an outcome to our adversaries can be beneficial to reach coordination, as well as allowing us to 'steer' adversaries to outcomes that are more beneficial to us. Teaching works well against followers, agents that are willing to go along with the proposal, but can lead to miscoordination otherwise. In the context of infinitely repeated games there is, as of yet, no clear formalism that tries to capture and combine these behaviours into a unified view in order to reach a solution of a game. In this paper, we propose such a formalism in the form of an algorithmic criterion, which uses the concept of targeted learning. As we will argue, this criterion can be a beneficial criterion to adopt in order to reach coordination. Afterwards we propose an algorithm that adheres to our criterion that is able to teach pure strategy Nash Equilibria to a broad class of opponents in a broad class of games and is able to follow otherwise, as well as able to perform well in self-play.

机译：在无限次重复的游戏中，向我们的对手传授结果的行为可能有益于达成协调，并允许我们将对手“引导”到对我们更有利的结果上。教学对于愿意跟进该建议的追随者，代理商非常有效，但否则会导致协调不善。迄今为止，在无限重复的游戏中，还没有明确的形式主义试图将这些行为捕捉并结合到一个统一的视图中，以寻求游戏的解决方案。在本文中，我们以算法准则的形式提出了这种形式主义，它使用了目标学习的概念。就像我们将要争论的那样，该标准可能是为了达成协调而采用的有益标准。之后，我们提出了一种符合我们标准的算法，该算法能够向各种游戏中的众多对手教授纯净策略纳什均衡，并且能够遵循其他准则，并且能够在自打中表现出色。

著录项

来源
《International conference on autonomous agents and multiagent systems;AAMAS 2011 》|2011年|p.481-488|共8页
会议地点
作者
Max Knobbout; Gerard A.W. Vreeswijk;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论 ;
关键词
game theory; implicit cooperation; coordination; teach-ing;

机译：博弈论隐性合作;协调;教学;

相似文献

外文文献
中文文献
专利

1. On the equivalence of optimality criterion and sequential approximate optimization methods in the classical topology layout problem [J] . Groenwold AA, Etman LFP International Journal for Numerical Methods in Engineering . 2008 ,第3期

机译：经典拓扑布局问题中最优准则的等价性和顺序近似优化方法
2. The effects of target discriminability and criterion placement on accuracy rates in sequential and simultaneous target-present lineups [J] . Heather Flowea* amp, Anneka Bessemerb Psychology, Crime & Law . 2011 ,第7期

机译：目标可分辨性和标准位置对顺序和同时出现的目标存在阵容中准确率的影响
3. Explaining cooperation in the finitely repeated simultaneous and sequential prisoner's dilemma game under incomplete and complete information [J] . Dijkstra Jacob, van Assen Marcel A. L. M. The Journal of Mathematical Sociology . 2017 ,第1a4期

机译：在不完整和完整信息下解释有限重复同时和顺序囚犯困境游戏的合作
4. Sequential targeted optimality as a new criterion for teaching and following in repeated games [C] . Max Knobbout, Gerard A. W. Vreeswijk International Joint Conference on Autonomous Agents and Multiagent Systems . 2011

机译：顺序目标最优值作为一种教学的新标准，并在重复游戏中进行
5. The mechanism design approach to optimality in repeated games with private information. [D] . Miller, David Aaron. 2004

机译：具有私人信息的重复游戏中最优性的机制设计方法。
6. eMedOffice: A web-based collaborative serious game for teaching optimal design of a medical practice [O] . Andreas Hannig, Nicole Kuth, Monika Özman, 2012

机译：eMedOffice：基于网络的协作式严肃游戏用于教授医疗实践的最佳设计
7. The effects of target discriminability and criterion placement on accuracy rates in sequential and simultaneous target-present lineups [O] . Flowe, H, Bessemer, A 2011

机译：目标可分辨性和标准位置对顺序和同时出现的目标存在阵容中准确率的影响
8. An activation criterion for repeated use of an optimal fixed time constant energy regulator [R] . Rempfer, P. S. 1965

机译：重复使用最佳固定时间常数能量调节器的激活标准

Sequential targeted optimality as a new criterion for teaching and following in repeated games

摘要

著录项

相似文献

相关主题

期刊订阅