Improving Branch-and-Bound Using Decision Diagrams and Reinforcement Learning

机译：使用决策图和强化学习改善分支和束缚

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Combinatorial optimization has found applications in numerous fields, from transportation to scheduling and planning. The goal is to find an optimal solution among a finite set of possibilities. Most exact approaches use relaxations to derive bounds on the objective function, which are embedded within a branch-and-bound algorithm. Decision diagrams provide a new approach for obtaining bounds that, in some cases, can be significantly better than those obtained with a standard linear programming relaxation. However, it is known that the quality of the bounds achieved through this bounding method depends on the ordering of variables considered for building the diagram. Recently, a deep reinforcement learning approach was proposed to compute a high-quality variable ordering. The bounds obtained exhibited improvements, but the mechanism proposed was not embedded in a branch-and-bound solver. This paper proposes to integrate learned optimization bounds inside a branch-and-bound solver, through the combination of reinforcement learning and decision diagrams. The results obtained show that the bounds can reduce the tree search size by a factor of at least three on the maximum independent set problem.

机译：组合优化在众多领域中发现了应用，从运输到调度和规划。目标是在有限的可能性中找到最佳解决方案。最精确的方法使用放宽来导出目标函数的界限，这些函数嵌入到分支和绑定算法中。判定图提供了一种用于获得界限的新方法，在某些情况下可以明显优于用标准线性编程松弛获得的界限。然而，已知通过该限定方法实现的界限的质量取决于考虑构建图表的变量的排序。最近，提出了一种深入的加强学习方法来计算高质量的变量排序。所获得的界限表现出改进，但提出的机制并不嵌入分支和结合的求解器中。本文通过加强学习和决策图的组合将学习优化范围集成在分支和绑定的求解器内。获得的结果表明，在最大独立的设置问题上，界限可以将树搜索大小减少至少三个。

著录项

来源
《International conference on integration of constraint programming, artificial intelligence, and operations research》|2021年|446-455|共10页
会议地点
作者
Augustin Parjadis; Quentin Cappart; Louis-Martin Rousseau; David Bergman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Decision diagrams; Branch-and-bound; Reinforcement learning;

机译：决策图;分支和束缚;加强学习;

相似文献

外文文献
中文文献
专利

1. Reinforcement, Rationality, and Intentions: How Robust Is Automatic Reinforcement Learning in Economic Decision Making? [J] . Huegelschaefer Sabine, Achtziger Anja Journal of Behavioral Decision Making . 2017,第4期

机译：强化，合理性和意图：自动强化学习在经济决策中的稳健性如何？
2. A reinforcement learning diffusion decision model for value-based decisions [J] . Fontanesi Laura, Gluth Sebastian, Spektor Mikhail S., Psychonomic bulletin & review . 2019,第4期

机译：基于价值的决策的加强学习扩散决策模型
3. Anatomy of a Decision: Striato-Orbitofrontal Interactions in Reinforcement Learning, Decision Making, and Reversal [J] . Michael J. Frank, Eric D. Claus Psychological Review . 2006,第2期

机译：决策剖析：强化学习，决策制定和逆转中的纹状体-眶额相互作用
4. Improving Optimization Bounds Using Machine Learning: Decision Diagrams Meet Deep Reinforcement Learning [C] . Quentin Cappart, Emmanuel Goutierre, David Bergman, AAAI Conference on Artificial Intelligence . 2019

机译：使用机器学习改善优化范围：决策图符合深度增强学习
5. Simulation-Based Optimization and Reinforcement Learning Methods to Improve Decision Making in Agriculture [D] . Moeinizade, Saba. 2021

机译：基于模拟的优化和加固学习方法，提高农业决策
6. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making reward prediction and punishment learning [O] . Pragathi P. Balasubramani, V. Srinivasa Chakravarthy, Balaraman Ravindran, 2014

机译：扩展的基底神经节强化学习模型以了解5-羟色胺和多巴胺在基于风险的决策奖励预测和惩罚学习中的作用
7. Improving Optimization Bounds Using Machine Learning: Decision Diagrams Meet Deep Reinforcement Learning [O] . Quentin Cappart, Emmanuel Goutierre, David Bergman, 2019

机译：使用机器学习改善优化界限：决策图符合深度增强学习

Improving Branch-and-Bound Using Decision Diagrams and Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅