首页> 外文会议>IEEE Conference on Decision and Control >Linear-quadratic stochastic differential Stackelberg games with a high population of followers
【24h】

Linear-quadratic stochastic differential Stackelberg games with a high population of followers

机译:具有大量追随者的线性二次随机微分Stackelberg游戏

获取原文

摘要

We consider a class of stochastic differential games with the Stackelberg mode of play, with one leader and N uniform followers (where N is sufficiently large), where each player has its own local controlled dynamics and quadratic cost function, with the coupling between the players being through the cost functions. Particularly, the leader's cost function has as input the average value of the states of the followers, and each follower's cost function has a similar term in addition to being directly affected by the control function of the leader; thus, the leader controls the behavior of the followers (who play a Nash game) through his control strategy. As such, this class of stochastic differential games is quite difficult to analyze and obtain the Stackelberg-Nash solution of. To circumvent this difficulty, our approach in this paper is to imbed the original game in a class of mean-field stochastic differential games, where the followers solve individual stochastic control problems given the mean field behavior of their average states and with leader's control taken as an exogenous stochastic process. We show that for each fixed policy of the leader, the followers' optimal decentralized local policies lead to an ???-Nash equilibrium, where ??? = O(1/???N). The paper then solves the leader's optimal control problem, as a constrained optimization problem, with the constraint being induced by the ???-Nash equilibrium policies of the followers (which depend on the leaders control as an exogenous process). We obtain the leaders optimal decentralized local control, which we subsequently show to constitute an O(1/???N)-approximate Stackelberg equilibrium for the original game. A numerical example included in the paper illustrates the theoretical results.
机译:我们考虑一类具有Stackelberg游戏模式的随机差分游戏,其中有一个领导者和N个统一的跟随者(其中N足够大),其中每个玩家都有自己的局部控制动力和二次成本函数,并且玩家之间具有耦合通过成本函数。特别地,领导者的成本函数具有跟随者的状态的平均值作为输入,并且除了直接受到领导者的控制功能的影响之外,每个跟随者的成本函数具有相似的项。因此,领导者通过控制策略来控制跟随者(玩纳什游戏)的行为。因此,这类随机差分博弈很难分析和获得其Stackelberg-Nash解。为了避免这种困难,本文中的方法是将原始博弈嵌入一类平均场随机微分游戏中,在这种情况下,跟随者根据给定其平均状态的平均场行为并以领导者的控制为准来解决单个随机控制问题。外生的随机过程。我们表明,对于领导者的每项固定政策,追随者的最优分散地方政策都会导致???-Nash均衡,其中??? = O(1 / ??? N)。然后,论文解决了作为约束优化问题的领导者的最优控制问题,其约束是由追随者的???-Nash均衡策略引起的(取决于领导者的控制是一个外生过程)。我们获得领导者最优分散的局部控制,随后我们证明它构成了原始游戏的O(1 / ??? N)近似Stackelberg平衡。本文包含的一个数值示例说明了理论结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号