RankNet for evaluation functions of the game of Go

Mandai Yusaku; Kaneko Tomoyuki

首页> 外文期刊>ICGA journal >RankNet for evaluation functions of the game of Go

【24h】

RankNet for evaluation functions of the game of Go

机译：RANKNET用于游戏的评估功能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a new algorithm for learning evaluation functions of the game of Go. Recently AlphaGo Zero and AlphaZero have shown that accurate evaluation functions can be constructed by using deep neural networks. Such a training, however, requires an enormous amount of computational resources that are not available for most researchers. One of the next challenges in this domain is constructing accurate evaluation functions with lesser computational resources. To tackle this problem, we apply the RankNet algorithm to training an AlphaGo Zero style unified Policy and Value network in a learning-to-rank fashion. Using the pairwise RankNet training increases the potential number of training examples and alleviates the requirements for the number of game records. Our modified RankNet algorithm trains both value and policy losses and its joint training makes the learning stable. Experimental results showed that neural networks trained by our algorithm showed higher playing strength than other methods, especially when the dataset sizes were relatively limited.

机译：在本文中，我们提出了一种新的算法，用于了解游戏的学习评估功能。最近，alphago Zero和Alphazero表明，可以通过使用深神经网络来构建准确的评估功能。然而，这种培训需要大多数研究人员不可用的巨大计算资源。此域中的下一个挑战之一是构造具有较小的计算资源的准确评估功能。为了解决这个问题，我们将rancyNet算法应用于培训alphano零样式统一策略和价值网络，以学习 - 排名方式。使用成对校准培训培训增加了潜在的培训例子，并减轻了游戏记录数量的要求。我们改进的RankNet算法列举了价值和政策损失，其联合培训使学习稳定。实验结果表明，我们的算法训练的神经网络显示出比其他方法更高的播放强度，特别是当数据集大小相对有限时。

著录项

来源
《ICGA journal》 |2019年第2期|78-91|共14页
作者
Mandai Yusaku; Kaneko Tomoyuki;
展开▼
作者单位

Univ Tokyo Tokyo Japan;

Univ Tokyo Tokyo Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. RankNet for evaluation functions of the game of Go [J] . Mandai Yusaku, Kaneko Tomoyuki ICGA journal . 2019,第2期

机译：RankNet用于Go游戏的评估功能
2. Using Chinese dark chess endgame databases to validate and fine-tune game evaluation functions [J] . Chang Hung-Jui, Chen Jr-Chang, Fan Gang-Yu, ICGA journal . 2018,第2期

机译：使用中国黑棋残局数据库验证和优化游戏评估功能
3. Heuristic Evaluation Functions for General Game Playing [J] . James E. Clune KI - Künstliche Intelligenz . 2011,第1期

机译：一般游戏的启发式评估功能
4. Learning of Evaluation Functions on Mini-Shogi Using Self-playing Game Records [C] . Masahiro Shioda, Takeshi Ito International Conference on Technologies and Applications of Artificial Intelligence . 2020

机译：使用自助游戏记录了解迷你唱片的评估功能
5. Part I: Synthesis of Functionalized Bile Acids Towards the Construction of Steroidal Macrocycles. Part II: Evaluation and Expansion of a Modular Card Game for Teaching Organic Chemistry [D] . Knudtson, Christopher Anton. 2019

机译：第一部分：官能化胆酸的合成术语术语纯癌构建。第二部分：用于教学有机化学的模块化纸牌游戏的评估和扩展
6. Development and Evaluation of Maze-Like Puzzle Games to Assess Cognitive and Motor Function in Aging and Neurodegenerative Diseases [O] . Tobias Nef, Alvin Chesham, Narayan Schütz, 2020

机译：迷宫类益智游戏的开发和评估以评估衰老和神经退行性疾病的认知和运动功能
7. Evaluating monotone Boolean functions and game trees in the priced information model [O] . Milanič Martin 2013

机译：在价格信息模型中评估单调布尔函数和游戏树

RankNet for evaluation functions of the game of Go

摘要

著录项

相似文献

相关主题

期刊订阅