【24h】

Learning Opening Strategy in the Game of Go

机译:在围棋游戏中学习开放策略

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present an experimental methodology and results for a machine learning approach to learning opening strategy in the game of Go, a game for which the best computer programs play only at the level of an advanced beginning human player. While the evaluation function in most computer Go programs consists of a carefully crafted combination of pattern matchers, expert rules, and selective search, we employ a neural network trained by self-play using temporal difference learning. Our focus is on the sequence of moves made at the beginning of the game. Experimental results indicate that our approach is effective for learning opening strategy, and they also identify higher-level features of the game that improve the quality of the learned evaluation function.
机译:在本文中,我们介绍了一种机器学习方法的实验方法和结果,该方法用于学习围棋游戏中的开放策略,该游戏中最好的计算机程序只能在高级入门人类玩家的水平上玩。虽然大多数计算机Go程序中的评估功能都是精心组合的模式匹配器,专家规则和选择性搜索组成的,但我们采用了通过时差学习进行自我游戏训练的神经网络。我们的重点是游戏开始时的动作顺序。实验结果表明,我们的方法对于学习开放策略是有效的,并且还可以识别游戏的高级功能,从而提高学习的评估功能的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号