【24h】

Improving Branch-and-Bound Using Decision Diagrams and Reinforcement Learning

机译:使用决策图和强化学习改善分支和束缚

获取原文

摘要

Combinatorial optimization has found applications in numerous fields, from transportation to scheduling and planning. The goal is to find an optimal solution among a finite set of possibilities. Most exact approaches use relaxations to derive bounds on the objective function, which are embedded within a branch-and-bound algorithm. Decision diagrams provide a new approach for obtaining bounds that, in some cases, can be significantly better than those obtained with a standard linear programming relaxation. However, it is known that the quality of the bounds achieved through this bounding method depends on the ordering of variables considered for building the diagram. Recently, a deep reinforcement learning approach was proposed to compute a high-quality variable ordering. The bounds obtained exhibited improvements, but the mechanism proposed was not embedded in a branch-and-bound solver. This paper proposes to integrate learned optimization bounds inside a branch-and-bound solver, through the combination of reinforcement learning and decision diagrams. The results obtained show that the bounds can reduce the tree search size by a factor of at least three on the maximum independent set problem.
机译:组合优化在众多领域中发现了应用,从运输到调度和规划。目标是在有限的可能性中找到最佳解决方案。最精确的方法使用放宽来导出目标函数的界限,这些函数嵌入到分支和绑定算法中。判定图提供了一种用于获得界限的新方法,在某些情况下可以明显优于用标准线性编程松弛获得的界限。然而,已知通过该限定方法实现的界限的质量取决于考虑构建图表的变量的排序。最近,提出了一种深入的加强学习方法来计算高质量的变量排序。所获得的界限表现出改进,但提出的机制并不嵌入分支和结合的求解器中。本文通过加强学习和决策图的组合将学习优化范围集成在分支和绑定的求解器内。获得的结果表明,在最大独立的设置问题上,界限可以将树搜索大小减少至少三个。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号