首页> 外文会议>IEEE Conference on Games >Evaluating the Complexity of Players’ Strategies using MCTS Iterations

【24h】

Evaluating the Complexity of Players’ Strategies using MCTS Iterations

机译：使用MCTS迭代评估玩家策略的复杂性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monte Carlo Tree Search (MCTS) does not require any prior knowledge about a game to play, except for its legal moves and end conditions. Thus, the same MCTS player can be applied (almost) as it is to a wide variety of games. Accordingly, MCTS may be used as a touchstone to evaluate artificial players on different games. In this paper, we propose to use MCTS to qualitatively evaluate the strength of artificial players as the minimum number of iterations that MCTS needs to perform equivalently to the target player. We define this value as the 'MCTS complexity' of the target player. We introduce a bisection procedure to compute the MCTS complexity of a player and present experiments to evaluate the proposed approach on three games: Connect4, Awari, and Othello. Initially, we apply our approach to compute the MCTS complexity of players implemented using MCTS with a known number of iterations, next to players using different strategies. Our preliminary results show that our approach can identify the number of iterations used by MCTS target players. When applied to players implementing unknown strategies, it produces results that are coherent with the underlying players’ strength, assigning higher values of MCTS complexity to stronger players. Our results also suggest that, by using iterations to evaluate the strength of players, we may be able to compare the strength of algorithms that would be incomparable in practice (e.g. a greedy strategy for Connect4 and alpha-beta pruning for Awari).

机译：蒙特卡洛树搜索（MCTS）不需要任何有关玩游戏的先验知识，除了其合法举动和最终条件外。因此，相同的MCTS播放器可以（几乎）应用于各种游戏。因此，MCTS可以用作评估不同游戏上的人工玩家的试金石。在本文中，我们建议使用MCTS来定性评估人工玩家的实力，作为MCTS与目标玩家等效执行所需的最小迭代次数。我们将此值定义为目标参与者的“ MCTS复杂度”。我们引入了两等分程序来计算玩家的MCTS复杂度，并提供实验以评估针对以下三种游戏的建议方法：Connect4，Awari和Othello。最初，我们使用方法来计算使用已知迭代次数的MCTS实现的播放器的MCTS复杂度，其次是使用不同策略的播放器。我们的初步结果表明，我们的方法可以确定MCTS目标参与者使用的迭代次数。当将其应用于实施未知策略的参与者时，其产生的结果与潜在参与者的实力相一致，从而将较高的MCTS复杂度值分配给实力较强的参与者。我们的结果还表明，通过使用迭代来评估玩家的实力，我们也许能够比较实践中无法比拟的算法的实力（例如，Connect4的贪婪策略和Awari的alpha-beta修剪）。

著录项

来源
《IEEE Conference on Games》|2019年|1-8|共8页
会议地点 London(GB)
作者
Pier Luca Lanzi;
展开▼
作者单位

Dipartimento di Elettronica Informazione e Bioingegneria Politecnico di Milano;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Games; Complexity theory; Monte Carlo methods; Uncertainty; Reliability; Upper bound; Law;

机译：游戏；复杂性理论；蒙特卡洛方法；不确定;可靠性;上限；法;
入库时间 2022-08-26 14:42:22

相似文献

外文文献
中文文献
专利

1. Complexity analysis of remanufacturing duopoly game with different competition strategies and heterogeneous players [J] . Shi Lian, Sheng Zhaohan, Xu Feng Nonlinear dynamics . 2015,第3期

机译：具有不同竞争策略和异类参与者的再制造双寡头博弈的复杂性分析
2. Autocratic strategies for infinitely iterated multiplayer social dilemma games ? [J] . E. Martirosyan, A. Govaert, M. Cao IFAC PapersOnLine . 2020,第2期

机译：无限迭代多人社交困境游戏的专制策略？
3. Zero-determinant Strategies for Multi-player Multi-action Iterated Games [J] . Xiaofan He, Huaiyu Dai, Peng Ning, IEEE signal processing letters . 2016,第3期

机译：多人多动作迭代游戏的零决定策略
4. Evaluating the Complexity of Players’ Strategies using MCTS Iterations [C] . Pier Luca Lanzi IEEE Conference on Games . 2019

机译：使用MCTS迭代评估玩家策略的复杂性
5. Design of protection and control strategies for low-loss MCT power converters. [D] . Quek, Danny. 1994

机译：低损耗MCT电源转换器的保护和控制策略设计。
6. ITERATIVE EVALUATION IN A MOBILE COUNSELING AND TESTING PROGRAM TO REACH PEOPLE OF COLOR AT RISK FOR HIV—NEW STRATEGIES IMPROVE PROGRAM ACCEPTABILITY EFFECTIVENESS AND EVALUATION CAPABILITIES [O] . Freya Spielberg, Ann Kurth, William Reidy, -1

机译：迭代评估一个移动咨询和颜色在危险中为HIV-NEW策略的测试程序以达人提高程序的可接受性有效性并评估能力
7. The consequences of switching strategies in a two-player iterated survival game [O] . Olivier Salagnac, John Wakeley 2021

机译：交换策略在双人迭代生存游戏中的后果

Evaluating the Complexity of Players’ Strategies using MCTS Iterations

摘要

著录项

相似文献

相关主题

期刊订阅