首页> 美国卫生研究院文献>Frontiers in Neuroscience >Reinforcement Learning With Low-Complexity Liquid State Machines
【2h】

Reinforcement Learning With Low-Complexity Liquid State Machines

机译:低复杂度液体状态机的强化学习

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We propose reinforcement learning on simple networks consisting of random connections of spiking neurons (both recurrent and feed-forward) that can learn complex tasks with very little trainable parameters. Such sparse and randomly interconnected recurrent spiking networks exhibit highly non-linear dynamics that transform the inputs into rich high-dimensional representations based on the current and past context. The random input representations can be efficiently interpreted by an output (or readout) layer with trainable parameters. Systematic initialization of the random connections and training of the readout layer using Q-learning algorithm enable such small random spiking networks to learn optimally and achieve the same learning efficiency as humans on complex reinforcement learning (RL) tasks like Atari games. In fact, the sparse recurrent connections cause these networks to retain fading memory of past inputs, thereby enabling them to perform temporal integration across successive RL time-steps and learn with partial state inputs. The spike-based approach using small random recurrent networks provides a computationally efficient alternative to state-of-the-art deep reinforcement learning networks with several layers of trainable parameters.
机译:我们建议在由尖峰神经元(递归和前馈)的随机连接组成的简单网络上进行强化学习,这种学习可以通过很少的可训练参数来学习复杂的任务。这种稀疏且随机互连的递归尖峰网络表现出高度的非线性动态,可以根据当前和过去的上下文将输入转换为丰富的高维表示。具有可训练参数的输出(或读出)层可以有效地解释随机输入表示。使用Q学习算法对随机连接进行系统初始化和对读出层进行训练,使此类小型随机尖峰网络能够在Atari游戏等复杂强化学习(RL)任务上实现最佳学习,并获得与人类相同的学习效率。实际上,稀疏的循环连接使这些网络保留了过去输入的衰落记忆,从而使它们能够跨连续的RL时间步执行时间积分,并利用部分状态输入进行学习。使用小型随机递归网络的基于尖峰的方法为具有多层可训练参数的最新深度强化学习网络提供了一种计算有效的替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号