首页> 外文会议>American Control Conference >Stochastic stability analysis of perturbed learning automata with constant step-size in strategic-form games
【24h】

Stochastic stability analysis of perturbed learning automata with constant step-size in strategic-form games

机译:策略形式博弈中具有恒定步长的扰动学习自动机的随机稳定性分析

获取原文

摘要

This paper considers a class of reinforcement-learning that belongs to the family of Learning Automata and provides a stochastic-stability analysis in strategic-form games. For this class of dynamics, convergence to pure Nash equilibria has been demonstrated only for the fine class of potential games. Prior work primarily provides convergence properties of the dynamics through stochastic approximations, where the asymptotic behavior can be associated with the limit points of an ordinary-differential equation (ODE). However, analyzing global convergence through the ODE-approximation requires the existence of a Lyapunov or a potential function, which naturally restricts the applicability of these algorithms to a fine class of games. To overcome these limitations, this paper introduces an alternative framework for analyzing stochastic-stability that is based upon an explicit characterization of the (unique) invariant probability measure of the induced Markov chain.
机译:本文考虑了属于学习自动机家族的一类强化学习,并提供了战略形式游戏中的随机稳定性分析。对于此类动力学,仅对于优秀的潜在游戏类别,已证明可以收敛到纯纳什均衡。先验工作主要通过随机逼近提供动力学的收敛特性,其中渐近行为可以与常微分方程(ODE)的极限点相关联。但是,通过ODE逼近分析全局收敛性需要存在Lyapunov或潜在函数,这自然将这些算法的适用性限制在良好的游戏类别中。为了克服这些限制,本文介绍了一种替代框架,用于分析随机稳定性,该框架基于对诱导马尔可夫链的(唯一)不变概率度量的显式表征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号