首页> 外文会议>Simulated evolution and learning >Swarm Reinforcement Learning Method Based on an Actor-Critic Method
【24h】

Swarm Reinforcement Learning Method Based on an Actor-Critic Method

机译:基于Actor-Critic方法的群体强化学习方法

获取原文
获取原文并翻译 | 示例

摘要

We recently proposed swarm reinforcement learning methods in which multiple agents are prepared and they learn not only by individual learning but also by learning through exchanging information among the agents. The methods have been applied to a problem in discrete state-action space so far, and Q-learning method has been used as the individual learning. Although many studies in reinforcement learning have been done for problems in the discrete state-action space, continuous state-action space is required for coping with most real-world tasks. This paper proposes a swarm reinforcement learning method based on an actor-critic method in order to acquire optimal policies rapidly for problems in the continuous state-action space. The proposed method is applied to an inverted pendulum control problem, and its performance is examined through numerical experiments.
机译:我们最近提出了一种群体强化学习方法,其中准备了多个代理,它们不仅通过个体学习来学习,而且通过在代理之间交换信息来学习。到目前为止,该方法已应用于离散状态作用空间中的问题,并且Q学习方法已用作个体学习。尽管已经针对离散状态动作空间中的问题进行了强化学习方面的许多研究,但应对大多数实际任务仍需要连续的状态动作空间。提出了一种基于行为者批判的群体强化学习方法,以快速获取连续状态-作用空间问题的最优策略。将该方法应用于倒立摆控制问题,并通过数值实验验证了其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号