Improving the Exploration in Upper Confidence Trees

机译：改进高置信度树的探索

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the standard version of the UCT algorithm, in the case of a continuous set of decisions, the exploration of new decisions is done through blind search. This can lead to very inefficient exploration, particularly in the case of large dimension problems, which often happens in energy management problems, for instance. In an attempt to use the information gathered through past simulations to better explore new decisions, we propose a method named Blind Value (BV). It only requires the access to a function that randomly draws feasible decisions. We also implement it and compare it to the original version of continuous UCT. Our results show that it gives a significant increase in convergence speed, in dimensions 12 and 80.

机译：在UCT算法的标准版本中，在连续的一组决策的情况下，通过盲目搜索来探索新决策。这可能会导致非常低效的探索，尤其是在大尺寸问题的情况下，例如在能源管理问题中经常发生的问题。为了尝试使用从过去的模拟中收集的信息来更好地探索新决策，我们提出了一种称为盲值（BV）的方法。它只需要访问随机得出可行决策的功能即可。我们还将实现它，并将其与连续UCT的原始版本进行比较。我们的结果表明，它在12和80维度上显着提高了收敛速度。

著录项

来源
《International conference on learning and intelligent optimization》|2012年|366-371|共6页
会议地点
作者
Adrien Coueetoux; Hassen Doghmen; Olivier Teytaud;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Modification of improved upper confidence bounds for regulating exploration in Monte-Carlo tree search [J] . Liu Yun-Ching, Tsuruoka Yoshimasa Theoretical computer science . 2016,第Null期

机译：在蒙特卡洛树搜索中修改改进的置信上限以规范勘探
2. Improving confidence in tree species selection for challenging urban sites: a role for leaf turgor loss [J] . Sjoman H., Hirons A. D., Bassuk N. L. Urban ecosystems . 2018,第6期

机译：提高对具有挑战性的城市用地的树种选择的信心：叶膨大损失的作用
3. Hybridizing Rapidly Exploring Random Trees and Basin Hopping Yields an Improved Exploration of Energy Landscapes [J] . Roth Christine-Andrea, Dreyfus Tom, Robert Charles H., Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological . 2016,第7a8期

机译：快速探索随机树和盆地跳跃的杂交产生了对能量景观的改进探索
4. Improving the Exploration in Upper Confidence Trees [C] . Adrien Cou?toux, Hassen Doghmen, Olivier Teytaud International Conference on Learning and Intelligent Optimization . 2012

机译：提高上置信树的探索
5. Methane Flux of Tree Stems And Mitigating the Impacts of Insect Outbreak Through Planting Alternative Tree Species Within the Upper Great Lakes Region, USA [D] . Bolton, Nicholas W. 2017

机译：通过在美国大湖区上游种植替代树种来减少树木茎的甲烷通量并减轻昆虫暴发的影响
6. EXPLORATION OF SCORE AGREEMENT ON A MODIFIED UPPER QUARTER Y-BALANCE TEST KIT AS COMPARED TO THE UPPER QUARTER Y-BALANCE TEST [O] . Josh Cramer, Miguel Quintero, Alex Rhinehart, 2017

机译：与上季度Y平衡测试比较的经修改的上季度Y平衡测试套件的评分协议的探索
7. Improving the exploration in Upper Confidence Trees [O] . Adrien Couëtoux, Hassen Doghmen, Olivier Teytaud 2012

机译：改进高置信度树的探索
8. DETECT: DEpendency Tree Evaluation and Confidence Test [R] . Cooke, R., van Noortwijk, J., Waij, R. 1988

机译：DETECT：依赖树评估和置信度测试

Improving the Exploration in Upper Confidence Trees

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅