【24h】

Guiding Combinatorial Optimization with UCT

机译:使用UCT引导组合优化

获取原文

摘要

We propose a new approach for search tree exploration in the context of combinatorial optimization, specifically Mixed Integer Programming (MIP), that is based on UCT, an algorithm for the multi-armed bandit problem designed for balancing exploration and exploitation in an online fashion. UCT has recently been highly successful in game tree search. We discuss the differences that arise when UCT is applied to search trees as opposed to bandits or game trees, and provide initial results demonstrating that the performance of even a highly optimized state-of-the-art MIP solver such as CPLEX can be boosted using UCT's guidance on a range of problem instances.
机译:在组合优化的背景下,我们提出了一种新的搜索树探索方法,特别是混合整数编程(MIP),即基于UCT,这是一种用于以在线方式平衡勘探和剥削的多武装强盗问题的算法。 UCT最近在游戏树搜索中非常成功。我们讨论了当UCT应用于搜索树的域而不是匪徒或游戏树时出现的差异,并提供初始结果,证明即使是高度优化的最先进的MIP解算器,例如CPLEX也可以促进使用UCT关于一系列问题实例的指导。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号