Computing the Lp-strong nash equilibrium looking for cooperative stability in multiple agents markov games

机译：计算Lp-强纳什均衡以寻找多个Agent Markov游戏中的合作稳定性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The notion of collaboration implies that related agents interact with each other looking for cooperative stability. This notion consents agents to select optimal strategies and to condition their own behavior on the behavior of others in a strategic forward looking manner. In game theory the collective stability is a special case of the Nash equilibrium called strong Nash equilibrium. In this paper we present a novel method for computing the Strong Lp-Nash equilibrium in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. We first present a general solution for the Lp-norm for computing the Strong Lp-Nash equilibrium and then, we suggest an explicit solution involving the norms L1 and L2. For solving the problem we use the extraproximal method. We employ the Tikhonov's regularization method to ensure the convergence of the cost-functions to a unique equilibrium point. The method converges in exponential time to a unique Strong Lp-Nash equilibrium. A game theory example illustrates the main results.

机译：协作的概念意味着相关主体相互交互以寻求协作稳定性。这种观点允许代理人选择最佳策略，并以策略性前瞻性方式将自己的行为以他人的行为为条件。在博弈论中，集体稳定性是纳什均衡的一种特殊情况，称为强纳什均衡。在本文中，我们提出了一种新的方法，用于计算一类时间离散的遍历可控马尔可夫链博弈的度量状态空间下的强Lp-纳什均衡。我们首先提出用于Lp范数的一般解，以计算Strong Lp-Nash平衡，然后，我们建议涉及范数L1和L2的显式解。为了解决这个问题，我们使用了近端方法。我们采用Tikhonov的正则化方法来确保成本函数收敛到唯一的平衡点。该方法在指数时间内收敛到唯一的强Lp-纳什平衡。一个博弈论的例子说明了主要结果。

著录项

来源
《International Conference on Electrical Engineering, Computing Science and Automatic Control》|2015年|1-6|共6页
会议地点
作者
Trejo Krital K.; Clempner Julio B.; Poznyak Alexander S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Chlorine; Cities and towns; Games; Markov processes; Nash equilibrium; Pareto optimization;

机译：氯;城镇;游戏;马尔可夫过程;纳什均衡;帕累托优化;

相似文献

外文文献
中文文献
专利

1. Computing the strong L_p- Nash equilibrium for Markov chains games: Convergence and uniqueness [J] . Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak Applied Mathematical Modelling . 2017,第Jana期

机译：计算马尔可夫链博弈的强大L_p- Nash均衡：收敛性和唯一性
2. Computing the strong Nash equilibrium for Markov chains games [J] . Clempner Julio B., Poznyak Alexander S. Applied mathematics and computation . 2015,第Null期

机译：计算马尔可夫链博弈的强纳什均衡
3. A novel method to compute Nash equilibrium in non-cooperative n-person games based on differential evolutionary algorithm [J] . Changbing Li, Huiying Cao, Maokang Du Intelligent decision technologies . 2014,第3期

机译：基于差分进化算法的非合作n人博弈中纳什均衡计算的新方法
4. Computing the Lp-strong nash equilibrium looking for cooperative stability in multiple agents markov games [C] . Trejo Krital K., Clempner Julio B., Poznyak Alexander S. International Conference on Electrical Engineering, Computing Science and Automatic Control . 2015

机译：计算LP-Sharly Nash均衡在Markov Games中寻找合作稳定性
5. Decentralized algorithms for Nash equilibrium problems-applications to multi-agent network interdiction games and beyond. [D] . Sreekumaran, Harikrishnan. 2015

机译：纳什均衡问题的分散算法-在多主体网络拦截游戏及其他应用中的应用。
6. From rationality to cooperativeness: The totally mixed Nash equilibrium in Markov strategies in the iterated Prisoner’s Dilemma [O] . Ivan S. Menshikov, Alexsandr V. Shklover, Tatiana S. Babkina, 2011

机译：从理性到合作：在囚徒困境中的马尔可夫策略中的完全混合纳什均衡
7. An Enhanced Model-Free Reinforcement Learning Algorithm to Solve Nash Equilibrium for Multi-Agent Cooperative Game Systems [O] . Yuannan Jiang, Fuxiao Tan 2020

机译：用于求解多方代代理合作游戏系统的纳什均衡的增强的无模型加强学习算法
8. New Strategy-Adjustment Process for Computing a Nash Equilibrium in a Noncooperative More-Person Game [R] . van den Elzen, A. , Talman, D. 1986

机译：计算非合作更多人游戏中纳什均衡的新策略调整过程

Computing the Lp-strong nash equilibrium looking for cooperative stability in multiple agents markov games

摘要

著录项

相似文献

相关主题

期刊订阅