首页> 外文会议>International conference on artificial life >Cooperation in an Unpredictable Environment
【24h】

Cooperation in an Unpredictable Environment

机译:在不可预测的环境中合作

获取原文

摘要

A framework for studying the evolution of cooperative behaviour in a random environment, using evolution of finite state strategies, is presented. The interaction between agents is modelled by a repeated game with random observable payoffs. The agents are thus faced with a more complex situation, compared to the Prisoner's Dilemma that has been widely used for investigating the conditions for cooperation in evolving populations (Matsuo 1985; Axelrod 1987; Miller 1989; Lindgren 1992; Ikegami 1994; Lindgren & Nordahl 1994; Lindgren 1997). Still, there are robust cooperating strategies that usually evolve in a population of agents. In the cooperative mode, these strategies selects an action that allows for maximizing the payoff sum of both players in each round, regardless of the own payoff. Two such players maximize the expected total long-term payoff. If the opponent deviates from this scheme, the strategy invokes a punishment action, which aims to lower the opponent's score for the rest of the (possibly infinitely) repeated game. The introduction of mistakes to the game actually pushes evolution towards more cooperative strategies even though the game becomes more difficult.
机译:提出了一种利用有限状态策略的演变研究随机环境中合作行为演变的框架。代理之间的相互作用是由重复游戏模拟的,具有随机可观察的收益。因此,与囚犯的困境相比,该代理人面临着更复杂的情况,这些情况已被广泛用于调查在不断发展的人群中的合作条件(Matsuo 1985; Axelrod 1987; Lindgren 1992; Ikegami 1994; Lindgren&Nordahl 1994; Lindgren&Nordahl 1994 ; LINDGREN 1997)。尽管如此,还有强大的合作策略,通常在代理人群体中发展。在合作模式中,这些策略选择了一种动作,允许在每轮中最大化两个玩家的支付总和,无论自己的回报如何。两个这样的玩家最大化预期的总长期收益。如果对手偏离了这一计划,该战略援引了一个惩罚行动,旨在降低对手的其他(可能无限)重复比赛的比分。对于游戏的错误引入实际上推动了更加合作策略的演变,即使游戏变得更加困难。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号