Computing Stackelberg Equilibria in Discounted Stochastic Games

机译：折扣随机游戏中的Stackelberg均衡计算

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stackelberg games increasingly influence security policies deployed in real-world settings. Much of the work to date focuses on devising a fixed randomized strategy for the defender, accounting for an attacker who optimally responds to it. In practice, defense policies are often subject to constraints and vary over time, allowing an attacker to infer characteristics of future policies based on current observations. A defender must therefore account for an attacker's observation capabilities in devising a security policy. We show that this general modeling framework can be captured using stochastic Stackelberg games (SSGs), where a defender commits to a dynamic policy to which the attacker devises an optimal dynamic response. We then offer the following contributions. 1) We show that Markov stationary policies suffice in SSGs, 2) present a finite-time mixed-integer non-linear program for computing a Stackelberg equilibrium in SSGs, and 3) present a mixed-integer linear program to approximate it. 4) We illustrate our algorithms on a simple SSG representing an adversarial patrolling scenario, where we study the impact of attacker patience and risk aversion on optimal defense policies.

机译：Stackelberg游戏越来越多地影响实际环境中部署的安全策略。迄今为止，大部分工作都集中在为防御者设计固定的随机策略上，以考虑对攻击者做出最佳响应的攻击者。在实践中，防御策略通常会受到约束，并且会随时间变化，从而使攻击者可以根据当前观察来推断未来策略的特征。因此，防御者必须在设计安全策略时考虑攻击者的观察能力。我们表明，可以使用随机Stackelberg游戏（SSG）捕获此通用建模框架，在该游戏中，防御者会遵循动态策略，攻击者会针对该策略制定最佳动态响应。然后，我们提供以下内容。 1）我们证明了马尔科夫平稳策略在SSG中就足够了; 2）提出了用于计算SSG中Stackelberg平衡的有限时间混合整数非线性程序，并且3）提出了近似的混合整数线性程序。 4）我们在一个简单的表示敌对巡逻场景的SSG上说明了算法，在此我们研究了攻击者的耐心和风险规避对最佳防御策略的影响。

著录项

来源
《IAAI-12;Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;Symposium on educational advances in artificial intelligence;AAAI-12;EAAI-12》|2012年|p.1478-1484|共7页
会议地点
作者
Yevgeniy Vorobeychik; Satinder Singh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Self-consistent Feedback Stackelberg Equilibria for Infinite Horizon Stochastic Games [J] . Dynamic games and applications . 2020,第2期

机译：自我一致的反馈Stackelberg equilibria用于无限的地平线随机游戏
2. COMPUTING THE STACKELBERG/NASH EQUILIBRIA USING THE EXTRAPROXIMAL METHOD: CONVERGENCE ANALYSIS AND IMPLEMENTATION DETAILS FOR MARKOV CHAINS GAMES [J] . Trejo Kristal K., Clempner Julio B., Poznyak Alexander S. International Journal of Applied Mathematics and Computer Science . 2015,第2期

机译：使用近端方法计算STACKELBERG / NASH平衡：马尔可夫链游戏的收敛性分析和实现细节
3. Computing the Stackelberg/Nash Equilibria Using the Extraproximal Method: Convergence Analysis and Implementation Details for Markov Chains Games [J] . Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak International journal of applied mathematics and computer science . 2015,第2期

机译：使用极近方法计算Stackelberg /纳什均衡：马尔可夫链游戏的收敛性分析和实现细节
4. Computing Stackelberg Equilibria in Discounted Stochastic Games [C] . Yevgeniy Vorobeychik, Satinder Singh Innovative applications of artificial intelligence conference . 2012

机译：计算Stackelberg折扣随机游戏的均衡
5. Real-time Load Balancing based on Stackelberg Game and Reinforcement Learning in Cloudlet Network [D] . Gu Zhiqiang 2020

机译：CloudStack网络中基于Stackelberg博弈和强化学习的实时负载均衡
6. Retailer Stackelberg game in a supply chain with pricing and service decisions and simple price discount contract [O] . Seyed Jafar Sadjadi, Hashem Asadi, Ramin Sadeghian, -1

机译：具有定价和服务决策以及简单的价格折扣合同的供应链中的零售商Stackelberg游戏
7. Stackelberg Equilibria for Discrete-Time Dynamic Games Part II: Stochastic Games with Deterministic Information Structure [O] . B. De Schutter, Bart De Schutter 2015

机译：离散时间动态博弈的stackelberg均衡第二部分：具有确定性信息结构的随机游戏

Computing Stackelberg Equilibria in Discounted Stochastic Games

摘要

著录项

相似文献

相关主题

期刊订阅