Self-Adapting Payoff Matrices in Repeated Interactions

机译：重复交互中的自适应支付矩阵

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Traditional iterated prisoner's dilemma (IPD) assumed a fixed payoff matrix for all players, which may not be realistic because not all players are the same in the real-world. This paper introduces a novel co-evolutionary framework where each strategy has its own self-adaptive payoff matrix. This framework is generic to any simultaneous two-player repeated encounter game. Here, each strategy has a set of behavioral responses based on previous moves, and an adaptable payoff matrix based on reinforcement feedback from game interactions that is specified by update rules. We study how different update rules affect the adaptation of initially random payoff matrices, and how this adaptation in turn affects the learning of strategy behaviors.

机译：传统迭代囚犯的困境（IPD）为所有球员承担了固定的支付矩阵，这可能不是现实的，因为并非所有球员都在现实世界中也是如此。本文介绍了一种新的共同进化框架，每个策略都有自己的自适应支付矩阵。此框架是通用的任何同时双人反复遇到的游戏。这里，每个策略具有基于先前移动的一组行为响应，以及基于由更新规则指定的游戏交互的增强反馈的适应性收益矩阵。我们研究不同的更新规则如何影响最初随机的回报矩阵的适应，以及这种适应如何反过来影响战略行为的学习。

著录项

来源
《IEEE Symposium on Computational Intelligence and Games》|2006年||共8页
会议地点
作者
Siang Y. Chong; Xin Yao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Evolutionary games; Co-evolution; Iterated Prisoner's Dilemma; Mutualism; Repeated Encounter Games;

机译：进化游戏;共同进化;迭代囚犯的困境;共同主义;重复遭遇游戏;

相似文献

外文文献
中文文献
专利

1. When is the lowest equilibrium payoff in a repeated game equal to the min max payoff? [J] . Olivier Gossner, Johannes Hoerner Journal of economic theory . 2010,第1期

机译：重复游戏中的最低均衡收益何时等于最小最大收益？
2. Approximation of Isomorphic Infinite Two-Person Non-Cooperative Games by Variously Sampling the Players’ Payoff Functions and Reshaping Payoff Matrices into Bimatrix Game [J] . Vadim V. Romanuke, Vladimir V. Kamburg Applied Computer Systems . 2016,第1期

机译：通过对玩家的支付功能进行各种采样并将支付矩阵重塑为Bimatrix游戏来逼近同构无限两人非合作游戏
3. Infinite products of random matrices and repeated interaction dynamics [J] . Laurent Bruneau, Alain Joye, Marco Merkli Annales de L'institut Henri Poincare . 2010,第2期

机译：随机矩阵和重复相互作用动力学的无限乘积
4. Self-Adapting Payoff Matrices in Repeated Interactions [C] . Siang Y. Chong, Xin Yao IEEE Symposium on Computational Intelligence and Games . 2006

机译：重复交互中的自适应支付矩阵
5. Towards Cooperating in Repeated Interactions Without Repeating Structure [D] . Pham, Huy. 2020

机译：在没有重复结构的情况下在重复的相互作用中进行合作
6. The evolution of payoff matrices: providing incentives to cooperate [O] . Erol Akçay, Joan Roughgarden 2011

机译：回报矩阵的演变：提供合作激励
7. Self-Adapting Payoff Matrices in Repeated Interactions [O] . Siang Y. Chong, Xin Yao 2008

机译：重复交互中的自适应支付矩阵

Self-Adapting Payoff Matrices in Repeated Interactions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅