首页> 外文会议>International conference on artificial neural networks >Continuous-Time Spike-Based Reinforcement Learning for Working Memory Tasks
【24h】

Continuous-Time Spike-Based Reinforcement Learning for Working Memory Tasks

机译:基于连续时间峰值的强化学习,用于工作记忆任务

获取原文

摘要

As the brain purportedly employs on-policy reinforcement learning compatible with SARSA learning, and most interesting cognitive tasks require some form of memory while taking place in continuous-time, recent work has developed plausible reinforcement learning schemes that are compatible with these requirements. Lacking is a formulation of both computation and learning in terms of spiking neurons. Such a formulation creates both a closer mapping to biology, and also expresses such learning in terms of asynchronous and sparse neural computation. We present a spiking neural network with memory that learns cognitive tasks in continuous time. Learning is biologically plausibly implemented using the AuGMeNT framework, and we show how separate spiking forward and feedback networks suffice for learning the tasks just as fast the analog CT-AuGMeNT counterpart, while computing efficiently using very few spikes: 1-20 Hz on average.
机译:据说大脑采用与SARSA学习兼容的基于策略的强化学习,并且大多数有趣的认知任务在连续时间内进行时都需要某种形式的记忆,因此最近的工作已经开发出了符合这些要求的合理的强化学习方案。缺乏是关于尖峰神经元的计算和学习的表述。这样的表述不仅创造了对生物学的更紧密的映射,而且还通过异步和稀疏的神经计算表达了这种学习。我们提出了一个具有记忆力的尖峰神经网络,可以连续不断地学习认知任务。使用AuGMeNT框架在生物学上似乎可以实现学习,并且我们展示了独立的前馈和反馈网络足以满足学习模拟CT-AuGMeNT对应对象的速度,同时使用很少的尖峰即可高效地进行计算:平均1-20 Hz。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号