首页> 外文OA文献 >Introducing decision entrustment mechanism into repeated bilateral agent interactions to achieve social optimality
【2h】

Introducing decision entrustment mechanism into repeated bilateral agent interactions to achieve social optimality

机译:将决策委托机制介绍反复双侧剂相互作用,以实现社会优选

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

During multiagent interactions, robust strategies are needed to help the agents to coordinate their actions on efficient outcomes. A large body of previous work focuses on designing strategies towards the goal of Nash equilibrium under self-play, which can be extremely inefficient in many situations such as prisoner’s dilemma game. To this end, we propose an alternative solution concept, socially optimal outcome sustained by Nash equilibrium (SOSNE), which refers to those outcomes that maximize the sum of all agents’ payoffs among all the possible outcomes that can correspond to a Nash equilibrium payoff profile in the infinitely repeated games. Adopting the solution concept of SOSNE guarantees that the system-level performance can be maximized provided that no agent will sacrifice its individual profits. On the other hand, apart from performing well under self-play, a good strategy should also be able to well respond against those opponents adopting different strategies as much as possible. To this end, we consider a particular class of rational opponents and we target at influencing those opponents to coordinate on SOSNE outcomes. We propose a novel learning strategy TaFSO which combines the characteristics of both teacher and follower strategies to effectively influence the opponent’s behavior towards SOSNE outcomes by exploiting their limitations. Extensive simulations show that our strategy TaFSO achieves better performance in terms of average payoffs obtained than previous work under both self-play and against the same class of rational opponents.
机译:在多读性互动期间,需要强大的策略来帮助代理人协调其对有效结果的行为。以前的工作中的大量工作侧重于自我播放下纳什均衡目标的设计策略,这在囚犯的困境游戏等许多情况下可能是极低效率的。为此,我们提出了替代解决方案概念,纳什均衡(Sosne)的社会最佳结果,这是指最大化所有代理商在纳什均衡支付概况的可能结果中最大化所有代理商的回报的结果在无限重复的游戏中。采用Sosne的解决方案概念保证系统级性能可以最大化,但没有代理人将牺牲其个性利润。另一方面,除了在自我发挥下表现良好,良好的策略也应该能够充分应对采用不同策略的对手尽可能多。为此,我们考虑了一类特定的理性对手,我们瞄准影响那些对手协调Sosne成果。我们提出了一种小说学习策略TAFSO,它将教师和追随者策略的特征结合在一起,通过利用其局限性来有效地影响对手对SOSNE成果的行为。广泛的模拟表明,我们的战略TAFSO在比以前的平均收益方面取得了更好的表现,而不是在自我扮演中获得的平均收益,并反对同一类理性对手。

著录项

  • 作者

    Jianye Hao; Ho-fung Leung;

  • 作者单位
  • 年度 2014
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号