Improved LinUCT and its evaluation on incremental random-feature tree

机译：改进的LinUCT及其对增量随机特征树的评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

UCT is a standard method of Monte Carlo tree search (MCTS) algorithms, which have been applied to various domains and have achieved remarkable success. This study proposes a family of Leaf-LinUCT, which are improved LinUCT algorithms incorporating LinUCB into MCTS. LinUCB outperforms UCB1 in contextual multi-armed bandit problems, owing to a kind of online learning with ridge regression. However, due to the minimax structure of game trees, ridge regression in LinUCB does not always work well in the context of tree search. In this paper, we remedy the problem and extend our previous work on LinUCT in two ways: (1) by restricting teacher data for regression to the frontier nodes in a current search tree, and (2) by adjusting the feature vector of each internal node to the weighted mean of the feature vector of the descendant nodes. We also present a new synthetic model, incremental-random-feature tree, by extending the standard incremental random tree model. In our model, each node has a feature vector that represents the characteristics of the corresponding position. The elements of a feature vector in a node are randomly changed from those in its parent node by each move, as the heuristic score of a node is randomly changed by each move in the standard incremental random tree model. The experimental results show that our Leaf-LinUCT outperformed UCT and existing LinUCT algorithms, in the incremental-random-feature tree and a synthetic game studied in [1].

机译：UCT是蒙特卡罗树搜索（MCTS）算法的标准方法，已应用于各个领域并取得了显著成功。这项研究提出了Leaf-LinUCT系列，这是将LinUCB集成到MCTS中的改进的LinUCT算法。 LinUCB在上下文多臂强盗问题上的表现优于UCB1，这是由于具有岭回归的一种在线学习。但是，由于游戏树的极大极小结构，LinUCB中的岭回归在树搜索的上下文中并不总是能很好地起作用。在本文中，我们对问题进行了补救，并以两种方式扩展了我们在LinUCT上的工作：（1）通过将教师数据限制为回归到当前搜索树中的前沿节点，以及（2）通过调整每个内部的特征向量节点到后代节点的特征向量的加权平均值。通过扩展标准的增量随机树模型，我们还提出了一种新的综合模型，即增量随机特征树。在我们的模型中，每个节点都有一个代表相应位置特征的特征向量。节点中特征向量的元素通过每次移动而与其父节点中的特征向量的元素随机变化，因为在标准增量随机树模型中，节点的启发式分数通过每次移动随机地变化。实验结果表明，在增量随机特征树和合成博弈中，我们的Leaf-LinUCT性能优于UCT和现有的LinUCT算法[1]。

著录项

来源
《IEEE Conference on Computational Intelligence and Games》|2016年|1-8|共8页
会议地点
作者
Yusaku Mandai; Tomoyuki Kaneko;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Games; Prediction algorithms; Monte Carlo methods; Standards; Regression tree analysis; Context; Estimation;

机译：游戏;预测算法; Monte Carlo方法;标准;回归树分析;上下文;估计;

相似文献

外文文献
中文文献
专利

1. Evaluation of Whole Tree Growth Increment Derived from Tree-Ring Series for Use in Assessments of Changes in Forest Productivity across Various Spatial Scales [J] . Juha M. Metsaranta, Jagtar S. Bhatti Forests . 2016,第12期

机译：评估从树环系列得出的整棵树的生长增量，用于评估各种空间尺度上森林生产力的变化
2. Constrained incremental tree building: new absolute fast converging phylogeny estimation methods with improved scalability and accuracy [J] . Qiuyi Zhang, Satish Rao, Tandy Warnow Algorithms for Molecular Biology . 2019,第1期

机译：约束增量树构建：具有改进的可伸缩性和准确性的新的绝对快速收敛系统发育估计方法
3. An Improved Algorithm for Incremental DFS Tree in Undirected Graphs [J] . Lijie Chen, Ran Duan, Ruosong Wang, LIPIcs : Leibniz International Proceedings in Informatics . 2018,第2期

机译：无向图中增量DFS树的一种改进算法
4. Improved LinUCT and its evaluation on incremental random-feature tree [C] . Yusaku Mandai, Tomoyuki Kaneko IEEE Conference on Computational Intelligence and Games . 2016

机译：改进incuct in in增量随机特征树的评估
5. Evaluating and improving Collection Tree Protocol in Mobile Wireless Sensor Network. [D] . Sharma, Dixit. 2011

机译：评估和改进移动无线传感器网络中的收集树协议。
6. Constrained incremental tree building: new absolute fast converging phylogeny estimation methods with improved scalability and accuracy [O] . Qiuyi Zhang, Satish Rao, Tandy Warnow 2019

机译：约束增量树构建：具有改进的可扩展性和准确性的新的绝对快速收敛系统发育估计方法
7. A Novel Approach to Evaluate the Effect of Neighboring Trees and the Orientation of Tree Social Area on Stem Radial Increment of Norway Spruce Trees [O] . Jan Světlík, Jan Krejza, Pavel Bednář 2021

机译：一种评价邻近树木与树社会区域方向对挪威云杉树木径向增量影响的新方法

Improved LinUCT and its evaluation on incremental random-feature tree

摘要

著录项

相似文献

相关主题

期刊订阅