...
【24h】

A new asynchronous reinforcement learning algorithm based on improved parallel PSO

机译:一种基于改进的并行PSO的新的异步增强学习算法

获取原文
获取原文并翻译 | 示例

摘要

As an important machine learning method, reinforcement learning plays a more and more important role in practical application. In recent years, many scholars have studied parallel reinforcement learning algorithm, and achieved remarkable results in many applications. However, when using existing parallel reinforcement learning to solve problems, due to the limited search scope of agents, it often fails to reduce the running episodes of algorithms. At the same time, the traditional model-free reinforcement learning algorithm does not necessarily converge to the optimal solution, which may lead to some waste of resources in practical applications. In view of these problems, we apply Particle swarm optimization (PSO) algorithm to asynchronous reinforcement learning algorithm to search for the optimal solution. First, we propose a new asynchronous variant of PSO algorithm. Then we apply it into asynchronous reinforcement learning algorithm, and proposed a new asynchronous reinforcement learning algorithm named Sarsa algorithm based on backward Q-learning and asynchronous particle swarm optimization (APSO-BQSA). Finally, we verify the effectiveness of the asynchronous PSO and APSO-BQSA algorithm proposed in this paper through experiments.
机译:作为一个重要的机器学习方法,加强学习在实际应用中起着越来越重要的作用。近年来,许多学者研究了并行加强学习算法,并在许多应用中实现了显着的结果。然而,在使用现有的并行增强学习来解决问题时,由于代理的搜索范围有限,它通常无法减少算法的运行剧集。同时,传统的无模型加强学习算法不一定会收敛到最佳解决方案,这可能导致在实际应用中的一些资源浪费。鉴于这些问题,我们将粒子群优化(PSO)算法应用于异步增强学习算法,以搜索最佳解决方案。首先,我们提出了一种新的PSO算法的异步变体。然后,我们将其应用于异步强化学习算法,并提出了一种基于后Q学习和异步粒子群优化(APSO-BQSA)的SASARSA算法的新的异步增强学习算法。最后,我们通过实验验证了本文中提出的异步PSO和APSO-BQSA算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号