首页> 外文会议>International Conference on Electrical Engineering, Computing Science and Automatic Control >Computing the Lp-strong nash equilibrium looking for cooperative stability in multiple agents markov games
【24h】

Computing the Lp-strong nash equilibrium looking for cooperative stability in multiple agents markov games

机译:计算Lp-强纳什均衡以寻找多个Agent Markov游戏中的合作稳定性

获取原文

摘要

The notion of collaboration implies that related agents interact with each other looking for cooperative stability. This notion consents agents to select optimal strategies and to condition their own behavior on the behavior of others in a strategic forward looking manner. In game theory the collective stability is a special case of the Nash equilibrium called strong Nash equilibrium. In this paper we present a novel method for computing the Strong Lp-Nash equilibrium in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. We first present a general solution for the Lp-norm for computing the Strong Lp-Nash equilibrium and then, we suggest an explicit solution involving the norms L1 and L2. For solving the problem we use the extraproximal method. We employ the Tikhonov's regularization method to ensure the convergence of the cost-functions to a unique equilibrium point. The method converges in exponential time to a unique Strong Lp-Nash equilibrium. A game theory example illustrates the main results.
机译:协作的概念意味着相关主体相互交互以寻求协作稳定性。这种观点允许代理人选择最佳策略,并以策略性前瞻性方式将自己的行为以他人的行为为条件。在博弈论中,集体稳定性是纳什均衡的一种特殊情况,称为强纳什均衡。在本文中,我们提出了一种新的方法,用于计算一类时间离散的遍历可控马尔可夫链博弈的度量状态空间下的强Lp-纳什均衡。我们首先提出用于Lp范数的一般解,以计算Strong Lp-Nash平衡,然后,我们建议涉及范数L1和L2的显式解。为了解决这个问题,我们使用了近端方法。我们采用Tikhonov的正则化方法来确保成本函数收敛到唯一的平衡点。该方法在指数时间内收敛到唯一的强Lp-纳什平衡。一个博弈论的例子说明了主要结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号