首页> 外文期刊>Applied mathematics and computation >Computing the strong Nash equilibrium for Markov chains games
【24h】

Computing the strong Nash equilibrium for Markov chains games

机译:计算马尔可夫链博弈的强纳什均衡

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present a novel method for finding the strong Nash equilibrium. The approach consists on determining a scalar lambda* and the corresponding strategies d*(lambda*) fixing specific bounds (min and max) that belong to the Pareto front. Bounds correspond to restrictions imposed by the player over the Pareto front that establish a specific decision area where the strategies can be selected. We first exemplify the Pareto front of the game in terms of a nonlinear programming problem adding a set of linear constraints for the Markov chain game based on the c-variable method. For solving the strong Nash equilibrium problem we propose to employ the Euler method and a penalty function with regularization. The Tikhonov's regularization method is used to guarantee the convergence to a single (strong) equilibrium point. Then, we established a nonlinear programming method to solve the successive single-objective constrained problems that arise from taking the regularized functional of the game. To achieve the goal, we implement the gradient method to solve the first-order optimality conditions. Starting from an utopia point (Pareto optimal point) given an initial lambda of the individual objectives the method solves an optimization problem adding linear constraints required to find the optimal strong strategy d*(lambda*). We show that in the regularized problem the functional of the game decrease and finally converges, proving the existence and uniqueness of strong Nash equilibrium (Pareto-optimal Nash equilibrium). In addition, we present the convergence conditions and compute the estimated rate of convergence of variables gamma and delta corresponding to the step size parameter of the gradient method and the Tikhonov's regularization respectively. Moreover, we provide all the details needed to implement the method in an efficient and numerically stable way. The usefulness of the method is successfully demonstrated by a numerical example. (C) 2015 Elsevier Inc. All rights reserved.
机译:在本文中,我们提出了一种寻找强纳什平衡的新方法。该方法包括确定标量lambda *和相应的策略d *(lambda *)固定属于Pareto前沿的特定范围(最小和最大)。界限对应于玩家在帕累托前面施加的限制,这些限制建立了可以选择策略的特定决策区域。我们首先以非线性规划问题为例,以基于c变量方法为Markov链博弈添加一组线性约束的非线性规划问题为例来说明博弈的Pareto前沿。为了解决强纳什均衡问题,我们建议采用欧拉方法和带正则化的罚函数。 Tikhonov的正则化方法用于保证收敛到单个(强)平衡点。然后,我们建立了一种非线性规划方法,以解决由于采用正则化游戏功能而引起的连续单目标约束问题。为了达到这个目的,我们采用梯度法求解一阶最优条件。从给定单个目标的初始拉姆达的乌托邦点(帕累托最优点)开始,该方法解决了一个优化问题,添加了找到最优强策略d *(lambda *)所需的线性约束。我们证明,在正则化问题中,博弈的功能降低并最终收敛,证明了强纳什均衡(帕累托最优纳什均衡)的存在和唯一性。此外,我们给出了收敛条件,并分别计算了与梯度法的步长参数和Tikhonov正则化相对应的变量gamma和delta的收敛速度。此外,我们提供了以有效且数值稳定的方式实施该方法所需的所有细节。数值例子成功地证明了该方法的有效性。 (C)2015 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号