...
首页> 外文期刊>Applied Soft Computing >Continuous probabilistic model building genetic network programming using reinforcement learning
【24h】

Continuous probabilistic model building genetic network programming using reinforcement learning

机译:利用强化学习建立遗传网络程序的连续概率模型

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, a novel probabilistic model-building evolutionary algorithm (so called estimation of distribution algorithm, or EDA), named probabilistic model building genetic network programming (PMBGNP), has been proposed. PMBGNP uses graph structures for its individual representation, which shows higher expression ability than the classical EDAs. Hence, it extends EDAs to solve a range of problems, suchas data mining and agent control. This paper is dedicated to propose a continuous version of PMBGNP for continuous optimization in agent control problems. Different from the other continuous EDAs, the proposed algorithm evolves the continuous variables by reinforcement learning (RL). We compare the performance with several state-of-the-art algorithms on a real mobile robot control problem. The results show that the proposed algorithm outperforms the others with statistically significant differences. (C) 2014 Elsevier B.V. All rights reserved.
机译:近来,已经提出了一种新的概率模型构建进化算法(所谓的分布估计算法,或EDA),称为概率模型构建遗传网络编程(PMBGNP)。 PMBGNP将图结构用于其单独表示,与传统的EDA相比,它具有更高的表达能力。因此,它扩展了EDA来解决一系列问题,例如数据挖掘和代理控制。本文致力于为代理控制问题的连续优化提出连续版本的PMBGNP。与其他连续EDA不同,该算法通过强化学习(RL)来演化连续变量。在实际的移动机器人控制问题上,我们将性能与几种最先进的算法进行了比较。结果表明,该算法优于其他算法,具有统计上的显着差异。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号