首页> 外文期刊>Scientific reports. >The serial blocking effect: a testbed for the neural mechanisms of temporal-difference learning
【24h】

The serial blocking effect: a testbed for the neural mechanisms of temporal-difference learning

机译:串行阻断效应:时差学习神经机制的试验平台

获取原文
       

摘要

Temporal-difference (TD) learning models afford the neuroscientist a theory-driven roadmap in the quest for the neural mechanisms of reinforcement learning. The application of these models to understanding the role of phasic midbrain dopaminergic responses in reward prediction learning constitutes one of the greatest success stories in behavioural and cognitive neuroscience. Critically, the classic learning paradigms associated with TD are poorly suited to cast light on its neural implementation, thus hampering progress. Here, we present a serial blocking paradigm in rodents that overcomes these limitations and allows for the simultaneous investigation of two cardinal TD tenets; namely, that learning depends on the computation of a prediction error, and that reinforcing value, whether intrinsic or acquired, propagates back to the onset of the earliest reliable predictor. The implications of this paradigm for the neural exploration of TD mechanisms are highlighted.
机译:时差(TD)学习模型为神经科学家寻求强化学习的神经机制提供了理论驱动的路线图。这些模型在理解阶段性中脑多巴胺能反应在奖励预测学习中的作用的应用,构成了行为和认知神经科学领域最大的成功案例之一。至关重要的是,与TD相关的经典学习范式不太适合对它的神经实现方法投以光明,从而阻碍了进步。在这里,我们提出了一种在啮齿动物中的串行阻断范例,它克服了这些限制并允许同时研究两个主要的TD原理;就是说,学习取决于预测误差的计算,强化值,无论是固有的还是获得的,都将传播回最早的可靠预测器。强调了这种范例对TD机制的神经探索的意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号