首页> 外文会议>Genetic and Evolutionary Computation Conference Pt.2 Jul 12-16, 2003 Chicago, IL, USA >Reinforcement Learning Estimation of Distribution Algorithm

【24h】

Reinforcement Learning Estimation of Distribution Algorithm

机译：分布算法的强化学习估计

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions to generate a new population of solutions. We call it Reinforcement Leaxning Estimation of Distribution Algorithm (RELEDA). For the estimation of the joint probability distribution we consider each variable as univariate. Then we update the probability of each variable by applying reinforcement learning method. Though we consider variables independent of one another, the proposed method can solve problems of highly correlated variables. To compare the efficiency of our proposed algorithm with other Estimation of Distribution Algorithms (EDAs) we provide the experimental results of the two problems: four peaks problem and bipolar function.

机译：本文提出了一种用于组合优化的算法，该算法使用强化学习和对有前途解决方案的联合概率分布进行估计，以生成新的解决方案总量。我们称其为分配算法的强化宽松估计（RELEDA）。为了估计联合概率分布，我们将每个变量视为单变量。然后我们通过应用强化学习方法来更新每个变量的概率。尽管我们认为变量彼此独立，但所提出的方法可以解决高度相关变量的问题。为了将我们提出的算法与其他分布估计算法（EDA）的效率进行比较，我们提供了两个问题的实验结果：四个峰问题和双极函数。

著录项

来源
《Genetic and Evolutionary Computation Conference Pt.2 Jul 12-16, 2003 Chicago, IL, USA 》|2003年|p.1259-1270|共12页
会议地点 San Francisco CA(US);San Francisco CA(US);San Francisco CA(US)
作者
Topon Kumar Paul; Hitoshi Iba;
展开▼
作者单位

Graduate School of Frontier Sciences, The University of Tokyo Hongo 7-3-1, Bunkyo-ku, Tokyo 113-8656, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类遗传学 ;
关键词

相似文献

外文文献
中文文献
专利

1. A Novel Graph-Based Estimation of the Distribution Algorithm and its Extension Using Reinforcement Learning [J] . Li X., Mabu S., Hirasawa K. IEEE transactions on evolutionary computation . 2014 ,第1期

机译：一种基于图的分布估计算法及其增强学习扩展
2. Reinforcement learning-based real time search algorithm for routing optimisation in wireless sensor networks using fuzzy link cost estimation [J] . Kuldeep Singh, Jyoteesh Malhotra International journal of communication networks and distributed systems . 2019 ,第4期

机译：基于增强学习的实时搜索算法，用于模糊传感器成本估算的无线传感器网络路由优化
3. Fuzzy Link Cost Estimation based Adaptive Tree Algorithm for Routing Optimization in Wireless Sensor Networks using Reinforcement Learning [J] . Kuldeep Singh, Jyoteesh Malhotra International Journal of Sensors, Wireless Communication and Control . 2018 ,第3期

机译：基于模糊的无线传感器网络路由优化的基于自适应树木算法
4. A continuous estimation of distribution algorithm by evolving graph structures using reinforcement learning [C] . Li Xianneng, Li Bing, Mabu Shingo, Evolutionary Computation (CEC), 2012 IEEE Congress on . 2012

机译：通过使用强化学习演化图结构来连续估计分布算法
5. Using prior knowledge and learning from experience in estimation of distribution algorithms. [D] . Hauschild, Mark. 2013

机译：利用先验知识并从经验中学习分配算法的估计。
6. Myocardial infarction evaluation from stopping time decision toward interoperable algorithmic states in reinforcement learning [O] . Jong-Rul Park, Sung Phil Chung, Sung Yeon Hwang, 2020

机译：从钢筋学习中停止时间决定的心肌梗死评估
7. Reinforcement Learning Estimation of Distribution Algorithm [O] . Topon Kumar Paul, Hitoshi Iba 2003

机译：分布算法的强化学习估计

Reinforcement Learning Estimation of Distribution Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅