Computing the strong Nash equilibrium for Markov chains games

Clempner Julio B.; Poznyak Alexander S.

首页> 外文期刊>Applied mathematics and computation >Computing the strong Nash equilibrium for Markov chains games

【24h】

Computing the strong Nash equilibrium for Markov chains games

机译：计算马尔可夫链博弈的强纳什均衡

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a novel method for finding the strong Nash equilibrium. The approach consists on determining a scalar lambda* and the corresponding strategies d*(lambda*) fixing specific bounds (min and max) that belong to the Pareto front. Bounds correspond to restrictions imposed by the player over the Pareto front that establish a specific decision area where the strategies can be selected. We first exemplify the Pareto front of the game in terms of a nonlinear programming problem adding a set of linear constraints for the Markov chain game based on the c-variable method. For solving the strong Nash equilibrium problem we propose to employ the Euler method and a penalty function with regularization. The Tikhonov's regularization method is used to guarantee the convergence to a single (strong) equilibrium point. Then, we established a nonlinear programming method to solve the successive single-objective constrained problems that arise from taking the regularized functional of the game. To achieve the goal, we implement the gradient method to solve the first-order optimality conditions. Starting from an utopia point (Pareto optimal point) given an initial lambda of the individual objectives the method solves an optimization problem adding linear constraints required to find the optimal strong strategy d*(lambda*). We show that in the regularized problem the functional of the game decrease and finally converges, proving the existence and uniqueness of strong Nash equilibrium (Pareto-optimal Nash equilibrium). In addition, we present the convergence conditions and compute the estimated rate of convergence of variables gamma and delta corresponding to the step size parameter of the gradient method and the Tikhonov's regularization respectively. Moreover, we provide all the details needed to implement the method in an efficient and numerically stable way. The usefulness of the method is successfully demonstrated by a numerical example. (C) 2015 Elsevier Inc. All rights reserved.

机译：在本文中，我们提出了一种寻找强纳什平衡的新方法。该方法包括确定标量lambda *和相应的策略d *（lambda *）固定属于Pareto前沿的特定范围（最小和最大）。界限对应于玩家在帕累托前面施加的限制，这些限制建立了可以选择策略的特定决策区域。我们首先以非线性规划问题为例，以基于c变量方法为Markov链博弈添加一组线性约束的非线性规划问题为例来说明博弈的Pareto前沿。为了解决强纳什均衡问题，我们建议采用欧拉方法和带正则化的罚函数。 Tikhonov的正则化方法用于保证收敛到单个（强）平衡点。然后，我们建立了一种非线性规划方法，以解决由于采用正则化游戏功能而引起的连续单目标约束问题。为了达到这个目的，我们采用梯度法求解一阶最优条件。从给定单个目标的初始拉姆达的乌托邦点（帕累托最优点）开始，该方法解决了一个优化问题，添加了找到最优强策略d *（lambda *）所需的线性约束。我们证明，在正则化问题中，博弈的功能降低并最终收敛，证明了强纳什均衡（帕累托最优纳什均衡）的存在和唯一性。此外，我们给出了收敛条件，并分别计算了与梯度法的步长参数和Tikhonov正则化相对应的变量gamma和delta的收敛速度。此外，我们提供了以有效且数值稳定的方式实施该方法所需的所有细节。数值例子成功地证明了该方法的有效性。（C）2015 Elsevier Inc.保留所有权利。

著录项

来源
《Applied mathematics and computation》 |2015年第null期|共17页
作者
Clempner Julio B.; Poznyak Alexander S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Strong Nash equilibrium; Pareto-optimal Nash equilibrium; Markov chains; Came theory;

机译：强纳什均衡;帕累托最优纳什均衡;马尔可夫链;相似理论;

相似文献

外文文献
中文文献
专利

1. Computing the strong L_p- Nash equilibrium for Markov chains games: Convergence and uniqueness [J] . Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak Applied Mathematical Modelling . 2017,第Jana期

机译：计算马尔可夫链博弈的强大L_p- Nash均衡：收敛性和唯一性
2. Computing the Nash Bargaining Solution for Multiple Players in Discrete-Time Markov Chains Games [J] . Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak Cybernetics and Systems . 2020,第1a4期

机译：在离散时间马尔可夫连锁店游戏中计算多个玩家的NASH讨价还价解决方案
3. COMPUTING THE STACKELBERG/NASH EQUILIBRIA USING THE EXTRAPROXIMAL METHOD: CONVERGENCE ANALYSIS AND IMPLEMENTATION DETAILS FOR MARKOV CHAINS GAMES [J] . Trejo Kristal K., Clempner Julio B., Poznyak Alexander S. International Journal of Applied Mathematics and Computer Science . 2015,第2期

机译：使用近端方法计算STACKELBERG / NASH平衡：马尔可夫链游戏的收敛性分析和实现细节
4. Computing the Lp-strong nash equilibrium looking for cooperative stability in multiple agents markov games [C] . Trejo Krital K., Clempner Julio B., Poznyak Alexander S. International Conference on Electrical Engineering, Computing Science and Automatic Control . 2015

机译：计算Lp-强纳什均衡以寻找多个Agent Markov游戏中的合作稳定性
5. CHARACTERIZATIONS OF STRONG ERGODICITY FOR CONTINUOUS TIME MARKOV CHAINS. [D] . SCOTT, MARK. 1979

机译：连续时间马尔可夫链的强电性特征。
6. Efficient Nash Equilibrium Resource Allocation Based on Game Theory Mechanism in Cloud Computing by Using Auction [O] . Amin Nezarat, GH Dastghaibifard -1

机译：拍卖中基于博弈论机制的高效纳什均衡资源分配
7. Computing the Stackelberg/Nash equilibria using the extraproximal method: Convergence analysis and implementation details for Markov chains games [O] . Trejo Kristal K., Clempner Julio B., Poznyak Alexander S. 2015

机译：使用extraproximal方法计算stackelberg / Nash均衡：markov链游戏的收敛性分析和实现细节

Computing the strong Nash equilibrium for Markov chains games

摘要

著录项

相似文献

相关主题

期刊订阅