【24h】

Reinforcement Learning Estimation of Distribution Algorithm

机译:分布算法的强化学习估计

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions to generate a new population of solutions. We call it Reinforcement Leaxning Estimation of Distribution Algorithm (RELEDA). For the estimation of the joint probability distribution we consider each variable as univariate. Then we update the probability of each variable by applying reinforcement learning method. Though we consider variables independent of one another, the proposed method can solve problems of highly correlated variables. To compare the efficiency of our proposed algorithm with other Estimation of Distribution Algorithms (EDAs) we provide the experimental results of the two problems: four peaks problem and bipolar function.
机译:本文提出了一种用于组合优化的算法,该算法使用强化学习和对有前途解决方案的联合概率分布进行估计,以生成新的解决方案总量。我们称其为分配算法的强化宽松估计(RELEDA)。为了估计联合概率分布,我们将每个变量视为单变量。然后我们通过应用强化学习方法来更新每个变量的概率。尽管我们认为变量彼此独立,但所提出的方法可以解决高度相关变量的问题。为了将我们提出的算法与其他分布估计算法(EDA)的效率进行比较,我们提供了两个问题的实验结果:四个峰问题和双极函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号