【24h】

Computing Stackelberg Equilibria in Discounted Stochastic Games

机译:折扣随机游戏中的Stackelberg均衡计算

获取原文

摘要

Stackelberg games increasingly influence security policies deployed in real-world settings. Much of the work to date focuses on devising a fixed randomized strategy for the defender, accounting for an attacker who optimally responds to it. In practice, defense policies are often subject to constraints and vary over time, allowing an attacker to infer characteristics of future policies based on current observations. A defender must therefore account for an attacker's observation capabilities in devising a security policy. We show that this general modeling framework can be captured using stochastic Stackelberg games (SSGs), where a defender commits to a dynamic policy to which the attacker devises an optimal dynamic response. We then offer the following contributions. 1) We show that Markov stationary policies suffice in SSGs, 2) present a finite-time mixed-integer non-linear program for computing a Stackelberg equilibrium in SSGs, and 3) present a mixed-integer linear program to approximate it. 4) We illustrate our algorithms on a simple SSG representing an adversarial patrolling scenario, where we study the impact of attacker patience and risk aversion on optimal defense policies.
机译:Stackelberg游戏越来越多地影响实际环境中部署的安全策略。迄今为止,大部分工作都集中在为防御者设计固定的随机策略上,以考虑对攻击者做出最佳响应的攻击者。在实践中,防御策略通常会受到约束,并且会随时间变化,从而使攻击者可以根据当前观察来推断未来策略的特征。因此,防御者必须在设计安全策略时考虑攻击者的观察能力。我们表明,可以使用随机Stackelberg游戏(SSG)捕获此通用建模框架,在该游戏中,防御者会遵循动态策略,攻击者会针对该策略制定最佳动态响应。然后,我们提供以下内容。 1)我们证明了马尔科夫平稳策略在SSG中就足够了; 2)提出了用于计算SSG中Stackelberg平衡的有限时间混合整数非线性程序,并且3)提出了近似的混合整数线性程序。 4)我们在一个简单的表示敌对巡逻场景的SSG上说明了算法,在此我们研究了攻击者的耐心和风险规避对最佳防御策略的影响。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号