【24h】

Mixed Strategy Extraction from UCT Tree in Security Games

机译:来自安全游戏中UCT树的混合策略提取

获取原文

摘要

In this paper a simulation-based approach to finding optimal defender strategy in multi-act Security Games (SG) played on a graph is proposed. The method employs the Upper Confidence Bounds applied to Trees (UCT) algorithm which relies on massive simulations of possible game scenarios. Three different variants of the algorithm are presented and compared with each other as well as against the Mixed Integer Linear Program (MILP) exact solution in terms of computational efficiency and memory requirements. Experimental evaluation shows that the method has a few times lower memory demands and is faster than MILP approach in majority of test cases while preserving quality of the resulting mixed strategies.
机译:在本文中,提出了一种基于模拟的方法来寻找在图表上播放的多动力安全游戏(SG)中的最佳防御者策略。 该方法采用应用于树(UCT)算法的上置信度界限,其依赖于可能的游戏场景的大规模仿真。 在计算效率和内存要求方面,呈现并彼此相互施加并相互比较算法的三种不同变体,以及对混合整数线性程序(MILP)精确解决方案。 实验评估表明,该方法的内存需求较少几倍,并且在大多数测试用例中的溶解方法越来越快,同时保留了所得混合策略的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号