首页> 外文会议>IEEE Annual Conference on Decision and Control >Linear-Quadratic Stochastic Differential Stackelberg Games with a High Population of Followers
【24h】

Linear-Quadratic Stochastic Differential Stackelberg Games with a High Population of Followers

机译:线性二次随机差动Stackelberg游戏,具有高粉丝的粉丝

获取原文

摘要

We consider a class of stochastic differential games with the Stackelberg mode of play, with one leader and N uniform followers (where N is sufficiently large), where each player has its own local controlled dynamics and quadratic cost function, with the coupling between the players being through the cost functions. Particularly, the leader's cost function has as input the average value of the states of the followers, and each follower's cost function has a similar term in addition to being directly affected by the control function of the leader; thus, the leader controls the behavior of the followers (who play a Nash game) through his control strategy. As such, this class of stochastic differential games is quite difficult to analyze and obtain the Stackelberg-Nash solution of. To circumvent this difficulty, our approach in this paper is to imbed the original game in a class of mean-field stochastic differential games, where the followers solve individual stochastic control problems given the mean field behavior of their average states and with leader's control taken as an exogenous stochastic process. We show that for each fixed policy of the leader, the followers' optimal decentralized local policies lead to an ∈-Nash equilibrium, where ∈ = O(1/{the square root of}N). The paper then solves the leader's optimal control problem, as a constrained optimization problem, with the constraint being induced by the ∈-Nash equilibrium policies of the followers (which depend on the leaders control as an exogenous process). We obtain the leaders optimal decentralized local control, which we subsequently show to constitute an O(1/{the square root of}N)-approximate Stackelberg equilibrium for the original game. A numerical example included in the paper illustrates the theoretical results.
机译:我们认为,一类与播放施塔贝格模式随机微分方程游戏,一个领导和N均匀的追随者(其中N足够大),其中每个玩家都有自己的本地控制,动态和二次成本函数,与玩家之间的耦合可通过所述成本函数。尤其,领导者的成本函数具有作为输入的追随者的状态的平均值,而每个跟随者的成本函数具有除了被直接受前导的控制功能的类似术语;因此,领导者控制的追随者通过他的控制策略(谁玩游戏纳什)的行为。因此,本类随机微分博弈是相当困难的分析和获得的斯坦博格 - 纳什的解决方案。为了克服这个困难,我们在本文的方法是嵌入原来的游戏一类的平均场随机微分游戏,那里的追随者解决给定的平均状态的平均场行为,并与领导者的控制作为单独的随机控制问题外源的随机过程。我们表明,为领导的每个固定的政策,追随者的最优分散本地策略导致∈-Nash均衡,其中∈= O(1 / {的平方根} N)。本文则解决了领导者的最优控制问题,作为一个约束优化问题,用约束而引起的追随者(这取决于领导人控制作为外生过程)的∈ - 纳什均衡策略。我们获得领袖最优分散局部控制,这是我们随后展示给构成的O -approximate Stackelberg均衡(N {的平方根} 1 /)为原来的游戏。包括在纸张的数值示例示出了理论结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号