RankNet for evaluation functions of the game of Go

Mandai Yusaku; Kaneko Tomoyuki

首页> 外文期刊>ICGA journal >RankNet for evaluation functions of the game of Go

【24h】

RankNet for evaluation functions of the game of Go

机译：RankNet用于Go游戏的评估功能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a new algorithm for learning evaluation functions of the game of Go. Recently AlphaGo Zero and AlphaZero have shown that accurate evaluation functions can be constructed by using deep neural networks. Such a training, however, requires an enormous amount of computational resources that are not available for most researchers. One of the next challenges in this domain is constructing accurate evaluation functions with lesser computational resources. To tackle this problem, we apply the RankNet algorithm to training an AlphaGo Zero style unified Policy and Value network in a learning-to-rank fashion. Using the pairwise RankNet training increases the potential number of training examples and alleviates the requirements for the number of game records. Our modified RankNet algorithm trains both value and policy losses and its joint training makes the learning stable. Experimental results showed that neural networks trained by our algorithm showed higher playing strength than other methods, especially when the dataset sizes were relatively limited.

机译：在本文中，我们提出了一种用于学习围棋游戏评估功能的新算法。最近，AlphaGo Zero和AlphaZero显示可以通过使用深度神经网络来构建准确的评估功能。但是，这种培训需要大量的计算资源，而这对于大多数研究人员而言是不可用的。该领域的下一个挑战是用较少的计算资源来构建准确的评估功能。为了解决这个问题，我们采用RankNet算法以按等级学习的方式训练AlphaGo零样式统一策略和价值网络。使用成对的RankNet训练可以增加训练示例的数量，并减轻对游戏记录数量的要求。我们改进的RankNet算法同时训练了价值损失和政策损失，其联合训练使学习稳定。实验结果表明，我们的算法训练的神经网络表现出比其他方法更高的播放强度，尤其是在数据集大小相对有限的情况下。

著录项

来源
《ICGA journal》 |2019年第2期|78-91|共14页
作者
Mandai Yusaku; Kaneko Tomoyuki;
展开▼
作者单位

Univ Tokyo, Tokyo, Japan;

Univ Tokyo, Tokyo, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. RankNet for evaluation functions of the game of Go [J] . Mandai Yusaku, Kaneko Tomoyuki ICGA journal . 2019,第2期

机译：RANKNET用于游戏的评估功能
2. Using Chinese dark chess endgame databases to validate and fine-tune game evaluation functions [J] . Chang Hung-Jui, Chen Jr-Chang, Fan Gang-Yu, ICGA journal . 2018,第2期

机译：使用中国黑棋残局数据库验证和优化游戏评估功能
3. Heuristic Evaluation Functions for General Game Playing [J] . James E. Clune KI - Künstliche Intelligenz . 2011,第1期

机译：一般游戏的启发式评估功能
4. Learning of Evaluation Functions on Mini-Shogi Using Self-playing Game Records [C] . Masahiro Shioda, Takeshi Ito International Conference on Technologies and Applications of Artificial Intelligence . 2020

机译：使用自助游戏记录了解迷你唱片的评估功能
5. Part I: Synthesis of Functionalized Bile Acids Towards the Construction of Steroidal Macrocycles. Part II: Evaluation and Expansion of a Modular Card Game for Teaching Organic Chemistry [D] . Knudtson, Christopher Anton. 2019

机译：第一部分：官能化胆酸的合成术语术语纯癌构建。第二部分：用于教学有机化学的模块化纸牌游戏的评估和扩展
6. Development and Evaluation of Maze-Like Puzzle Games to Assess Cognitive and Motor Function in Aging and Neurodegenerative Diseases [O] . Tobias Nef, Alvin Chesham, Narayan Schütz, 2020

机译：迷宫类益智游戏的开发和评估以评估衰老和神经退行性疾病的认知和运动功能
7. Evaluating monotone Boolean functions and game trees in the priced information model [O] . Milanič Martin 2013

机译：在价格信息模型中评估单调布尔函数和游戏树

RankNet for evaluation functions of the game of Go

摘要

著录项

相似文献

相关主题

期刊订阅