首页> 外文会议> >Evolutionary optimization of neural networks for reinforcement learning algorithms
【24h】

Evolutionary optimization of neural networks for reinforcement learning algorithms

机译:强化学习算法的神经网络进化优化

获取原文
获取原文并翻译 | 示例

摘要

In this paper we study the combination of two powerful approaches, evolutonary topology optimization (ENZO) and temporal dfiference learning (TD (#lambda#)) which is up to our knowledge the first time. Temporal difference learning was proven to be a well suited technique for learning strategies for solving reinforcement problems based on enrual network models, whereas evolutionary topology optimization is concurrently the most efficient network optimization technique. On two benchmarks, a labyrinth problem and the game Nine Men's Morris, the power of the approach is demonstrated. We concude that this combination of evolution and reinforcement learning algorithms is a suitable framework that uses the advantages of both methods leading to small and high performing networks for reinforcement problems.
机译:在本文中,我们研究了两种强大的方法的结合,即进化的拓扑结构优化(ENZO)和时间差异学习(TD(#lambda#)),这是我们第一次掌握的知识。时间差异学习被证明是一种非常适合的技术,用于基于enrual网络模型解决强化问题的学习策略,而进化拓扑优化同时也是最有效的网络优化技术。在两个基准上,一个迷宫问题和一个游戏《九个人的莫里斯》,证明了这种方法的强大功能。我们认为,进化和强化学习算法的这种组合是一个合适的框架,它利用两种方法的优点导致了强化问题的小型和高性能网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号