首页> 外文会议>IEEE Conference on Computational Intelligence and Games >Comparison of rapid action value estimation variants for general game playing
【24h】

Comparison of rapid action value estimation variants for general game playing

机译:普通游戏的快速动作值估计变量的比较

获取原文

摘要

General Game Playing (GGP) aims at creating computer programs able to play any arbitrary game at an expert level given only its rules. The lack of game-specific knowledge and the necessity of learning a strategy online have made Monte-Carlo Tree Search (MCTS) a suitable method to tackle the challenges of GGP. An efficient search-control mechanism can substantially increase the performance of MCTS. The RAVE strategy and its more recent variant, GRAVE, have been proposed for this reason. In this paper we further investigate the use of GRAVE for GGP and compare its performance with the more established RAVE strategy and with a new variant, called HRAVE, that uses more global information. Experiments show that for some games GRAVE and HRAVE perform better than RAVE, with GRAVE being the most promising one overall.
机译:通用游戏(GGP)的目的是创建能够在仅给出规则的情况下以专家级别玩任意游戏的计算机程序。缺乏特定于游戏的知识以及在线学习策略的必要性,使得蒙特卡洛树搜索(MCTS)成为应对GGP挑战的合适方法。有效的搜索控制机制可以大大提高MCTS的性能。由于这个原因,提出了RAVE策略及其最近的变体GRAVE。在本文中,我们将进一步研究GRAVE在GGP中的使用,并将其性能与更成熟的RAVE策略以及使用更多全球信息的新版本HRAVE进行比较。实验表明,对于某些游戏,GRAVE和HRAVE的性能要优于RAVE,其中GRAVE是最有前途的游戏。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号