Continuous-Time Spike-Based Reinforcement Learning for Working Memory Tasks

机译：基于连续时间峰值的强化学习，用于工作记忆任务

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

As the brain purportedly employs on-policy reinforcement learning compatible with SARSA learning, and most interesting cognitive tasks require some form of memory while taking place in continuous-time, recent work has developed plausible reinforcement learning schemes that are compatible with these requirements. Lacking is a formulation of both computation and learning in terms of spiking neurons. Such a formulation creates both a closer mapping to biology, and also expresses such learning in terms of asynchronous and sparse neural computation. We present a spiking neural network with memory that learns cognitive tasks in continuous time. Learning is biologically plausibly implemented using the AuGMeNT framework, and we show how separate spiking forward and feedback networks suffice for learning the tasks just as fast the analog CT-AuGMeNT counterpart, while computing efficiently using very few spikes: 1-20 Hz on average.

机译：据说大脑采用与SARSA学习兼容的基于策略的强化学习，并且大多数有趣的认知任务在连续时间内进行时都需要某种形式的记忆，因此最近的工作已经开发出了符合这些要求的合理的强化学习方案。缺乏是关于尖峰神经元的计算和学习的表述。这样的表述不仅创造了对生物学的更紧密的映射，而且还通过异步和稀疏的神经计算表达了这种学习。我们提出了一个具有记忆力的尖峰神经网络，可以连续不断地学习认知任务。使用AuGMeNT框架在生物学上似乎可以实现学习，并且我们展示了独立的前馈和反馈网络足以满足学习模拟CT-AuGMeNT对应对象的速度，同时使用很少的尖峰即可高效地进行计算：平均1-20 Hz。

著录项

来源
《International conference on artificial neural networks》|2018年|250-262|共13页
会议地点
作者
Marios Karamanis; Davide Zambrano; Sander Bohte;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Reinforcement learning; Working memory; Spiking neurons;

机译：强化学习;工作记忆;尖刺神经元;

相似文献

外文文献
中文文献
专利

1. Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task [J] . Viejo Guillaume, Girard Benoit, Procyk Emmanuel, Behavioural Brain Research: An International Journal . 2018,第期

机译：在非人类灵长类动物中执行试验和错误问题解决任务的自适应协调
2. How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis [J] . CollinsA.G.E., FrankM.J. The European Journal of Neuroscience . 2012,第7a8期

机译：强化学习中有多少是工作记忆而不是强化学习？行为，计算和神经遗传学分析
3. Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning [J] . Guillaume Viejo, Mehdi Khamassi, Andrea Brovelli, Frontiers in Behavioral Neuroscience . 2015,第2期

机译：通过自适应工作记忆和强化学习的协调，对任意视觉运动学习中的选择和反应时间进行建模
4. Continuous-Time Spike-Based Reinforcement Learning for Working Memory Tasks [C] . Marios Karamanis, Davide Zambrano, Sander Bohte International Conference on Artificial Neural Networks . 2018

机译：基于连续秒的工作记忆任务的钢筋学习
5. Concurrent verbalization, task complexity, and working memory: Effects on L2 learning in a computerized task. [D] . Medina, Almitra Dadin. 2008

机译：并发说明，任务复杂性和工作记忆：对计算机化任务中的L2学习的影响。
6. How much of reinforcement learning is working memory not reinforcement learning? A behavioral computational and neurogenetic analysis [O] . Anne G. E. Collins, Michael J. Frank -1

机译：钢筋学习多少是工作记忆而不是加强学习？行为计算和神经肝分析
7. Continuous-time on-policy neural reinforcement learning of working memory tasks [O] . Zambrano, Davide, Roelfsema, P.R., Bohte, Sander 2015

机译：工作记忆任务的连续时间按策略进行神经强化学习
8. Extending Hierarchical Reinforcement Learning to Continuous-Time, Average-Reward, and Multi-Agent Models [R] . Ghavamzadeh, M. , Mahadevan, S. , Makar, R. 2003

机译：将分层强化学习扩展到连续时间，平均奖励和多智能体模型

Continuous-Time Spike-Based Reinforcement Learning for Working Memory Tasks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅