...
首页> 外文期刊>IIE Transactions >A Reinforcement Learning Algorithm For Obtaining The Nash Equilibrium Of Multi-player Matrix Games
【24h】

A Reinforcement Learning Algorithm For Obtaining The Nash Equilibrium Of Multi-player Matrix Games

机译:一种获得多人矩阵游戏纳什均衡的强化学习算法

获取原文
获取原文并翻译 | 示例
           

摘要

With the advent of e-commerce, the contemporary marketplace has evolved significantly toward competition-based trading of goods and services. Competition in many such market scenarios can be modeled as matrix games. This paper presents a computational algorithm to obtain the Nash equilibrium of n-player matrix games. The algorithm uses a stochastic-approximation-based Reinforcement Learning (RL) approach and has the potential to solve n-player matrix games with large player-action spaces. The proposed RL algorithm uses a value-iteration-based approach, which is well established in the Markov decision processes/semi-Markov decision processes literature. To emphasize the broader impact of our solution approach for matrix games, we discuss the established connection of matrix games with discounted and average reward stochastic games, which model a much larger class of problems. The solutions from the RL algorithm are extensively benchmarked with those obtained from an openly available software (GAMBIT). This comparative analysis is performed on a set of 16 matrix games with up to four players and 64 action choices. We also implement our algorithm on practical examples of matrix games that arise due to strategic bidding in restructured electric power markets.
机译:随着电子商务的出现,当代市场已朝着基于竞争的商品和服务贸易发展。在许多这样的市场情景中,竞争可以建模为矩阵博弈。本文提出了一种计算算法来获得n玩家矩阵博弈的纳什均衡。该算法使用基于随机逼近的强化学习(RL)方法,具有解决具有较大玩家动作空间的n玩家矩阵游戏的潜力。提出的RL算法使用基于值迭代的方法,该方法在Markov决策过程/半Markov决策过程文献中得到了很好的确立。为了强调我们的解决方案方法对矩阵游戏的广泛影响,我们讨论了矩阵游戏与打折和平均奖励随机游戏之间的已建立联系,该模型模拟了更多类型的问题。 RL算法的解决方案使用从公开软件(GAMBIT)获得的解决方案进行了广泛的基准测试。这项比较分析是在一组16个矩阵游戏中进行的,最多可有4个玩家和64个动作选择。我们还对由于重组电力市场中的战略竞标而产生的矩阵博弈的实际示例实施了我们的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号