首页> 外文会议>International conference on cognitive neurodynamics >Combining Supervised, Unsupervised, and Reinforcement Learning in a Network of Spiking Neurons
【24h】

Combining Supervised, Unsupervised, and Reinforcement Learning in a Network of Spiking Neurons

机译:在尖峰网络网络中结合监督,无监督和加强学习

获取原文

摘要

The human brain constantly learns via mutiple different learning strategies. It can learn by simply having stimuli being presented to its sensory org'ans which is considered unsupervised learning. In addition, it can learn associations between inputs and outputs when a teacher provides the output which is considered as supervised learning. Most importantly, it can learn very efficiently if correct behaviour is followed by reward and/or incorrect behaviour is followed by punishment which is considered reinforcement learning. So far, most artificial neural architectures implement only one of the three learning mechanisms — even though the brain integrates all three. Here, we have implemented unsupervised, supervised, and reinforcement learning within a network of spiking neurons. In order to achieve this ambitious goal, the existing learning rule called spike-timing-dependent plasticity had to be extended such that it is modulated by the reward signal dopamine.
机译:人类大脑经常通过多次不同的学习策略学习。它可以通过简单地向其感官的org'ans呈现刺激而学习,这被认为是无监督的学习。此外,当教师提供被视为监督学习的输出时,它可以学习输入和输出之间的关联。最重要的是,如果正确的行为随后是奖励和/或不正确的行为之后,它可以非常有效地学习,然后被视为被认为是强化学习的惩罚。到目前为止,大多数人工神经结构只实现了三种学习机制中的一个 - 尽管大脑整合了这三个。在这里,我们在尖峰神经元网络中实施了无监督,监督和强化学习。为了实现这种雄心勃勃的目标,必须延长所谓的现有学习规则,使得它由奖励信号多巴胺调节。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号