首页> 外文期刊>Autonomous agents and multi-agent systems >A class of iterative refined Max-sum algorithms via non-consecutive value propagation strategies
【24h】

A class of iterative refined Max-sum algorithms via non-consecutive value propagation strategies

机译:一类基于非连续值传播策略的迭代优化Max-sum算法

获取原文
获取原文并翻译 | 示例
           

摘要

As an important technique to solve distributed constraint optimization problems, Max-sum has drawn a lot of attention and successfully been deployed in real applications. Unfortunately, Max-sum fails to converge in cyclic problems and usually traverses states with low quality. Max-sum_AD and Max-sum_ADVP were proposed to guarantee the single phase convergence and the cross phase convergence respectively, and greatly improve the solution quality of Max-sum. However, the solution quality is closely related to the timing for starting value propagation in Max-sum_ADVP. In other words, low-quality initial assignments will lead to a poor result. In this paper, we prove that value propagation could restrict the exploration ability brought by Max-sum and eventually makes Max-sum_ADVP equivalent to a sequential greedy local search algorithm. For getting a balance between exploration and exploitation, several non-consecutive value propagation strategies are proposed to relax the restriction caused by value propagation: single-side value propagation which executes value propagation and Max-sum_AD in an interleaved way, probabilistic value propagation which performs value propagation stochastically and hybrid belief/value propagation where agents perform Max-sum_AD and value propagation in one round. We illustrate that agents in our algorithms can make decisions beyond local functions. Our empirical evaluations demonstrate the superiority of our methods over Max-sum and its variants. It also can be found that our methods are independent of the value propagation timing which is a major concern in Max-sum_ADVP.
机译:作为解决分布式约束优化问题的一项重要技术,Max-sum引起了很多关注,并成功地部署在实际应用中。不幸的是,Max-sum无法收敛于循环问题,并且通常以低质量遍历状态。提出了Max-sum_AD和Max-sum_ADVP,分别保证了单相收敛和交叉相位收敛,大大提高了Max-sum的求解质量。但是,解决方案质量与Max-sum_ADVP中起始值传播的时间密切相关。换句话说,低质量的初始分配将导致较差的结果。在本文中,我们证明了值传播会限制Max-sum带来的探索能力,并最终使Max-sum_ADVP等效于顺序贪婪局部搜索算法。为了在勘探与开发之间取得平衡,提出了几种非连续的值传播策略来缓解由值传播引起的限制:单边值传播以交错方式执行值传播和Max-sum_AD,概率值传播以执行随机进行价值传播和混合信念/价值传播,其中代理在一个回合中执行Max-sum_AD和价值传播。我们说明了算法中的代理可以做出超出局部功能的决策。我们的经验评估表明,我们的方法优于Max-sum及其变体。还可以发现,我们的方法与值传播时序无关,这是Max-sum_ADVP中的主要问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号