首页> 外文会议>International conference on learning and intelligent optimization >Improving the Exploration in Upper Confidence Trees
【24h】

Improving the Exploration in Upper Confidence Trees

机译:改进高置信度树的探索

获取原文
获取外文期刊封面目录资料

摘要

In the standard version of the UCT algorithm, in the case of a continuous set of decisions, the exploration of new decisions is done through blind search. This can lead to very inefficient exploration, particularly in the case of large dimension problems, which often happens in energy management problems, for instance. In an attempt to use the information gathered through past simulations to better explore new decisions, we propose a method named Blind Value (BV). It only requires the access to a function that randomly draws feasible decisions. We also implement it and compare it to the original version of continuous UCT. Our results show that it gives a significant increase in convergence speed, in dimensions 12 and 80.
机译:在UCT算法的标准版本中,在连续的一组决策的情况下,通过盲目搜索来探索新决策。这可能会导致非常低效的探索,尤其是在大尺寸问题的情况下,例如在能源管理问题中经常发生的问题。为了尝试使用从过去的模拟中收集的信息来更好地探索新决策,我们提出了一种称为盲值(BV)的方法。它只需要访问随机得出可行决策的功能即可。我们还将实现它,并将其与连续UCT的原始版本进行比较。我们的结果表明,它在12和80维度上显着提高了收敛速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号