【24h】

Cooperation in an Unpredictable Environment

机译:变幻莫测的环境中的合作

获取原文
获取原文并翻译 | 示例

摘要

A framework for studying the evolution of cooperative behaviour in a random environment, using evolution of finite state strategies, is presented. The interaction between agents is modelled by a repeated game with random observable payoffs. The agents are thus faced with a more complex situation, compared to the Prisoner's Dilemma that has been widely used for investigating the conditions for cooperation in evolving populations (Matsuo 1985; Axelrod 1987; Miller 1989; Lindgren 1992; Ikegami 1994; Lindgren & Nordahl 1994; Lindgren 1997). Still, there are robust cooperating strategies that usually evolve in a population of agents. In the cooperative mode, these strategies selects an action that allows for maximizing the payoff sum of both players in each round, regardless of the own payoff. Two such players maximize the expected total long-term payoff. If the opponent deviates from this scheme, the strategy invokes a punishment action, which aims to lower the opponent's score for the rest of the (possibly infinitely) repeated game. The introduction of mistakes to the game actually pushes evolution towards more cooperative strategies even though the game becomes more difficult.
机译:提出了一种使用有限状态策略研究随机环境中合作行为演变的框架。代理之间的交互是通过具有随机可观察到的收益的重复博弈建模的。因此,与被广泛用于调查不断变化的人口合作条件的《囚徒困境》相比,特工面临着更为复杂的局面(Matsuo 1985; Axelrod 1987; Miller 1989; Lindgren 1992; Ikegami 1994; Lindgren&Nordahl 1994) ; Lindgren 1997)。仍然有一些强大的合作策略,通常会在代理商群体中发展。在合作模式下,这些策略选择一个动作,该动作允许在每个回合中最大化两个玩家的收益总和,而与自己的收益无关。两个这样的参与者使预期的长期总收益最大化。如果对手偏离了该方案,则该策略将调用惩罚动作,该动作旨在降低其余(可能无限地)重复游戏的对手得分。实际上,即使游戏变得更加困难,将错误引入游戏中实际上也会促使进化朝着更加合作的策略发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号