首页> 外文会议>International conference on autonomous agents and multiagent systems;AAMAS 2011 >Sequential targeted optimality as a new criterion for teaching and following in repeated games
【24h】

Sequential targeted optimality as a new criterion for teaching and following in repeated games

机译:顺序目标最优作为重复游戏中教学和跟随的新标准

获取原文

摘要

In infinitely repeated games, the act of teaching an outcome to our adversaries can be beneficial to reach coordination, as well as allowing us to 'steer' adversaries to outcomes that are more beneficial to us. Teaching works well against followers, agents that are willing to go along with the proposal, but can lead to miscoordination otherwise. In the context of infinitely repeated games there is, as of yet, no clear formalism that tries to capture and combine these behaviours into a unified view in order to reach a solution of a game. In this paper, we propose such a formalism in the form of an algorithmic criterion, which uses the concept of targeted learning. As we will argue, this criterion can be a beneficial criterion to adopt in order to reach coordination. Afterwards we propose an algorithm that adheres to our criterion that is able to teach pure strategy Nash Equilibria to a broad class of opponents in a broad class of games and is able to follow otherwise, as well as able to perform well in self-play.
机译:在无限次重复的游戏中,向我们的对手传授结果的行为可能有益于达成协调,并允许我们将对手“引导”到对我们更有利的结果上。教学对于愿意跟进该建议的追随者,代理商非常有效,但否则会导致协调不善。迄今为止,在无限重复的游戏中,还没有明确的形式主义试图将这些行为捕捉并结合到一个统一的视图中,以寻求游戏的解决方案。在本文中,我们以算法准则的形式提出了这种形式主义,它使用了目标学习的概念。就像我们将要争论的那样,该标准可能是为了达成协调而采用的有益标准。之后,我们提出了一种符合我们标准的算法,该算法能够向各种游戏中的众多对手教授纯净策略纳什均衡,并且能够遵循其他准则,并且能够在自打中表现出色。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号