首页> 外文期刊>Autonomous agents and multi-agent systems >Introducing decision entrustment mechanism into repeated bilateral agent interactions to achieve social optimality
【24h】

Introducing decision entrustment mechanism into repeated bilateral agent interactions to achieve social optimality

机译:在反复的双边代理商互动中引入决策委托机制,以实现社会最优

获取原文
获取原文并翻译 | 示例
           

摘要

During multiagent interactions, robust strategies are needed to help the agents to coordinate their actions on efficient outcomes. A large body of previous work focuses on designing strategies towards the goal of Nash equilibrium under self-play, which can be extremely inefficient in many situations such as prisoner's dilemma game. To this end, we propose an alternative solution concept, socially optimal outcome sustained by Nash equilibrium (SOSNE), which refers to those outcomes that maximize the sum of all agents' payoffs among all the possible outcomes that can correspond to a Nash equilibrium payoff profile in the infinitely repeated games. Adopting the solution concept of SOSNE guarantees that the system-level performance can be maximized provided that no agent will sacrifice its individual profits. On the other hand, apart from performing well under self-play, a good strategy should also be able to well respond against those opponents adopting different strategies as much as possible. To this end, we consider a particular class of rational opponents and we target at influencing those opponents to coordinate on SOSNE outcomes. We propose a novel learning strategy TaFSO which combines the characteristics of both teacher and follower strategies to effectively influence the opponent's behavior towards SOSNE outcomes by exploiting their limitations. Extensive simulations show that our strategy TaFSO achieves better performance in terms of average payoffs obtained than previous work under both self-play and against the same class of rational opponents.
机译:在多主体交互过程中,需要强有力的策略来帮助主体协调其针对有效结果的行动。先前的大量工作着重于设计在自我游戏下实现纳什均衡目标的策略,这在许多情况下(例如囚徒困境游戏)效率极低。为此,我们提出了一个替代的解决方案概念,即纳什均衡(SOSNE)维持的社会最优结果,指的是那些在所有可能的结果中都能够最大化对应于纳什均衡收益概况的所有代理人收益之和的那些结果。在无限重复的游戏中。采用SOSNE的解决方案概念可以确保在没有代理商牺牲其个人利润的情况下,可以最大化系统级性能。另一方面,除了在自我游戏中表现出色外,好的策略还应该能够很好地应对那些采用不同策略的对手。为此,我们考虑一类特殊的理性反对者,我们的目标是影响那些反对者以协调SOSNE结果。我们提出了一种新颖的学习策略TaFSO,该策略结合了教师策略和追随者策略的特征,以通过利用对手的局限性来有效地影响对手对SOSNE结果的行为。大量的模拟表明,在自打和同类型理性对手的竞争下,我们的策略TaFSO在获得的平均收益方面比以前的工作表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号