【24h】

A Self-Acquiring Knowledge Process for MCTS

机译:MCTS的自学知识过程

获取原文
获取原文并翻译 | 示例
           

摘要

MCTS (Monte Carlo Tree Search) is a well-known and efficient process to cover and evaluate a large range of states for combinatorial problems. We choose to study MCTS for the Computer Go problem, which is one of the most challenging problem in the field of Artificial Intelligence. For this game, a single combinatorial approach does not always lead to a reliable evaluation of the game states. In order to enhance MCTS ability to tackle such problems, one can benefit from game specific knowledge in order to increase the accuracy of the game state evaluation. Such a knowledge is not easy to acquire. It is the result of a constructivist learning mechanism based on the experience of the player. That is why we explore the idea to endow the MCTS with a process inspired by constructivist learning, to self-acquire knowledge from playing experience. In this paper, we propose a complementary process for MCTS called BHRF (Background History Reply Forest), which allows to memorize efficient patterns in order to promote their use through the MCTS process. Our experimental results lead to promising results and underline how self-acquired data can be useful for MCTS based algorithms.
机译:MCTS(蒙特卡罗树搜索)是一种众所周知的高效过程,可以涵盖和评估各种状态下的组合问题。我们选择针对MC问题研究MCTS,这是人工智能领域最具挑战性的问题之一。对于此游戏,单个组合方法并不总是导致对游戏状态的可靠评估。为了增强MCTS处理此类问题的能力,人们可以从游戏特定的知识中受益,从而提高游戏状态评估的准确性。这样的知识不容易获得。这是基于玩家经验的建构主义学习机制的结果。这就是为什么我们探索这种想法的原因,使MCTS具有受建构主义学习启发的过程,可以从演奏经验中自我获取知识。在本文中,我们提出了一种称为BHRF(背景历史回复森林)的MCTS补充流程,该流程可以记住有效的模式,以通过MCTS流程促进其使用。我们的实验结果带来了令人鼓舞的结果,并强调了自我获取的数据如何可用于基于MCTS的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号