...
首页> 外文期刊>Web Intelligence and Agent Systems >Reinforcement social learning of social optimality with influencer agents
【24h】

Reinforcement social learning of social optimality with influencer agents

机译:通过影响者代理人加强社会对社会最优性的学习

获取原文
获取原文并翻译 | 示例

摘要

In many multiagent systems (MAS), it is desirable that the agents can coordinate with one another on achieving socially optimal outcomes to increase the system level performance, and the traditional way of attaining this goal is to endow the agents with social rationality [in: Proc. of AAAI Fall Symposium on Socially Intelligent Agents, 1997, pp. 61-63] - agents act as system utility maximizers. However, this is difficult to implement when we are facing open MAS domains such as peer-to-peer network and mobile ad-hoc networks, since we do not have control on all agents' behaviors in such systems and each agent usually behaves individually rationally as an individual utility maximizer only. In this paper, we propose injecting a number of influencer agents [in: Proc. of AAMAS'13, ACM Press, 2013, pp. 447-454, AAMAS (2012)] to manipulate the behaviors of individually rational agents and investigate whether the individually rational agents can eventually be incentivized to coordinate on achieving socially optimal outcomes. We evaluate the effects of influencer agents in two common types of games: prisoner's dilemma games and anti-coordination games. Simulation results show that a small proportion of influencer agents can significantly increase the average percentage of socially optimal outcomes attained in the system and better performance can be achieved compared with that of previous work.
机译:在许多多主体系统(MAS)中,理想的是,主体之间可以相互协调以实现社会最优结果,从而提高系统级的性能,而达到这一目标的传统方式是赋予主体以社会理性[程序AAAI关于社会智能代理的秋季研讨会,1997年,第61-63页]-代理充当系统效用最大化器。但是,当我们面对开放的MAS域(例如对等网络和移动自组织网络)时,这很难实现,因为我们无法控制此类系统中所有代理的行为,并且每个代理通常会理性地行为仅作为单个效用最大化器。在本文中,我们建议注入许多有影响力的代理人。 AAMAS'13,ACM Press,2013,第447-454页,AAMAS(2012)]来操纵个体理性行为者的行为,并研究是否最终可以激励个体理性行为者协调实现社会最优结果。我们在两种常见的游戏类型中评估影响者的影响:囚徒困境游戏和反协调游戏。仿真结果表明,一小部分影响者可以显着提高系统中所获得的社会最优结果的平均百分比,并且与以前的工作相比,可以获得更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号