首页> 外文会议>International conference on decision and game theory for security >Adaptive Regret Minimization in Bounded-Memory Games
【24h】

Adaptive Regret Minimization in Bounded-Memory Games

机译:界记记忆游戏中的自适应遗憾最小化

获取原文

摘要

Organizations that collect and use large volumes of personal information often use security audits to protect data subjects from inappropriate uses of this information by authorized insiders. In face of unknown incentives of employees, a reasonable audit strategy for the organization is one that minimizes its regret. While regret minimization has been extensively studied in repeated games, the standard notion of regret for repeated games cannot capture the complexity of the interaction between the organization (defender) and an adversary, which arises from dependence of rewards and actions on history. To account for this generality, we introduce a richer class of games called bounded-memory games, which can provide a more accurate model of the audit process. We introduce the notion of k-adaptive regret, which compares the reward obtained by playing actions prescribed by the algorithm against a hypothetical k-adaptive adversary with the reward obtained by the best expert in hindsight against the same adversary. Roughly, a hypothetical k-adaptive adversary adapts her strategy to the defender's actions exactly as the real adversary would within each window of k rounds. A k-adaptive adversary is a natural model for temporary adversaries (e.g., company employees) who stay for a certain number of audit cycles and are then replaced by a different person. Our definition is parameterized by a set of experts, which can include both fixed and adaptive defender strategies. We investigate the inherent complexity of and design algorithms for adaptive regret minimization in bounded-memory games of perfect and imperfect information. We prove a hardness result showing that, with imperfect information, any k-adaptive regret minimizing algorithm (with fixed strategies as experts) must be inefficient unless NP = RP even when playing against an oblivious adversary. In contrast, for bounded-memory games of perfect and imperfect information we present approximate 0-adaptive regret minimization algorithms against an oblivious adversary running in time n~0(1).
机译:收集和使用大量个人信息的组织经常使用安全审计来保护不适当使用这些信息的数据受试者通过授权的内部人员。面对员工的未知激励,该组织的合理审计战略是最小化其遗憾的审计策略。虽然在重复的游戏中广泛研究了遗憾最小化,但重复游戏的遗憾的标准概念不能捕捉组织(后卫)和对手之间互动的复杂性,这产生了奖励和历史行动的依赖。要考虑这一普遍性,我们介绍了一个富裕的游戏,称为有限记忆游戏,可以提供更准确的审计过程模型。我们介绍了K-Adaptive遗憾的概念,它比较了通过施用算法规定的行动而获得的奖励,并通过最佳专家对同样对手的最佳专家获得的奖励来扮演假设的K-Adviceivera。粗略地,假设的K-Adaptive Funersary将她的策略适应后卫的行为,就像真正的对手将在K轮的每个窗口内一样。 K-Adaptive Funersary是临时对手(例如,公司雇员)的自然模型,他们留在一定数量的审计周期,然后被不同的人所取代。我们的定义由一组专家参数化,该专家可以包括固定和自适应后卫策略。我们研究了完美和不完美信息的界限记忆游戏中自适应遗憾最小化的固有复杂性。我们证明了一个硬度结果,表明,具有不完美的信息,任何K-Adaptive遗憾最小化算法(以固定策略为专家),除非NP = RP即使在对抗一个令人沮丧的对手时也是效率。相比之下,对于完美和不完美信息的有界记忆游戏,我们呈现近似的0自适应遗体最小化算法,反对在时间n〜0(1)的令人沮丧的逆境中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号