首页> 外文期刊>Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics >Self-organization in a simple model of adaptive agents playing 2x2 games with arbitrary payoff matrices
【24h】

Self-organization in a simple model of adaptive agents playing 2x2 games with arbitrary payoff matrices

机译:在具有任意支付矩阵的2x2游戏的自适应代理的简单模型中的自组织

获取原文
获取原文并翻译 | 示例
           

摘要

We analyze, both analytically and numerically, the self-organization of a system of "selfish" adaptive agents playing an arbitrary iterated pairwise game (defined by a 2x2 payoff matrix). Examples of possible games to play are the prisoner's dilemma (PD) game, the chicken game, the hero game, etc. The agents have no memory, use strategies not based on direct reciprocity nor "tags" and are chosen at random, i.e., geographical vicinity is neglected. They can play two possible strategies: cooperate (C) or defect (D). The players measure their success by comparing their utilities with an estimate for the expected benefits and update their strategy following a simple rule. Two versions of the model are studied: (1) the deterministic version (the agents are either in definite states C or D) and (2) the stochastic version (the agents have a probability c of playing C). Using a general master equation we compute the equilibrium states into which the system self-organizes, characterized by their average probability of cooperation c(eq). Depending on the payoff matrix, we show that c(eq) can take five different values. We also consider the mixing of agents using two different payoff matrices and show that any value of c(eq) can be reached by tuning the proportions of agents using each payoff matrix. In particular, this can be used as a way to simulate the effect of a fraction d of "antisocial" individuals-incapable of realizing any value to cooperation-on the cooperative regime hold by a population of neutral or "normal" agents.
机译:我们在分析和数值上都分析了一个“自私的”自适应主体系统的自组织,该系统扮演着任意迭代的成对博弈(由2x2收益矩阵定义)。可能玩的游戏的例子是囚徒困境(PD)游戏,鸡游戏,英雄游戏等。代理人没有记忆力,使用的策略不是基于直接互惠也不是“标签”,而是随机选择的,即,地理位置被忽略。他们可以采取两种可能的策略:合作(C)或缺陷(D)。参与者通过将其效用与预期收益的估计值进行比较来衡量其成功,并遵循简单的规则来更新其策略。研究了模型的两个版本:(1)确定性版本(代理处于确定状态C或D)和(2)随机版本(代理具有玩C的概率c)。使用通用主方程,我们计算出系统自组织成的平衡状态,其平均合作概率为c(eq)。根据收益矩阵,我们表明c(eq)可以采用五个不同的值。我们还考虑了使用两个不同的回报矩阵对代理商进行混合,并表明可以通过使用每个回报矩阵调整代理商的比例来获得任何c(eq)值。特别地,这可以用作模拟一部分“反种族的”个体(无法实现任何合作价值)对中性或“正常”行为者群体所持有的合作制度的影响的方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号