Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model.

Johnson A; Redish AD

首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model.

【24h】

Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model.

机译：海马重放在时间差异强化学习模型中有助于会话内学习。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Temporal difference reinforcement learning (TDRL) algorithms, hypothesized to partially explain basal ganglia functionality, learn more slowly than real animals. Modified TDRL algorithms (e.g. the Dyna-Q family) learn faster than standard TDRL by practicing experienced sequences offline. We suggest that the replay phenomenon, in which ensembles of hippocampal neurons replay previously experienced firing sequences during subsequent rest and sleep, may provide practice sequences to improve the speed of TDRL learning, even within a single session. We test the plausibility of this hypothesis in a computational model of a multiple-T choice-task. Rats show two learning rates on this task: a fast decrease in errors and a slow development of a stereotyped path. Adding developing replay to the model accelerates learning the correct path, but slows down the stereotyping of that path. These models provide testable predictions relating the effects of hippocampal inactivation as well as hippocampal replay on this task.

机译：假设时间差异增强学习（TDRL）算法可以部分解释基底神经节功能，但其学习速度要比真实动物慢。修改后的TDRL算法（例如Dyna-Q系列）通过离线练习有经验的序列比标准TDRL学习得更快。我们建议重播现象（其中海马神经元的集合在随后的休息和睡眠过程中重播先前经历的放电序列）可能提供练习序列以提高TDRL学习的速度，即使在单个会话中也是如此。我们在多T选择任务的计算模型中测试了该假设的合理性。大鼠在此任务上显示出两种学习率：错误的快速减少和刻板印象的路径的缓慢发展。在模型中添加开发重播可加快学习正确路径的速度，但会减慢该路径的定型观念。这些模型提供了有关该任务海马灭活以及海马重播影响的可预测预测。

著录项

来源
《Neural Networks: The Official Journal of the International Neural Network Society》 |2005年第9期|共9页
作者
Johnson A; Redish AD;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类基础医学;
关键词
Behavior; Animal; Computer Simulation; Hippocampus; Maze Learning; Models; Neurological; 行为; 动物; 计算机模拟; 海马; 迷宫学习; 模型; 神经学; 强化(心理学);

机译：Behavior;Animal;Computer Simulation;Hippocampus;Maze Learning;Models;Neurological;行为;动物;计算机模拟;海马;迷宫学习;模型;神经学;强化(心理学);

相似文献

外文文献
中文文献
专利

1. Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model. [J] . Johnson A, Redish AD Neural Networks: The Official Journal of the International Neural Network Society . 2005,第9期

机译：海马重放在时间差异强化学习模型中有助于会话内学习。
2. Hippocampal replays under the scrutiny of reinforcement learning models [J] . Caze Romain, Khamassi Mehdi, Aubin Lise, Journal of Neurophysiology . 2018,第6期

机译：在加固学习模型的审查下的海马重播
3. Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity [J] . Bo Liu, Ian Gemp, Mohammad Ghavamzadeh, The Journal of Artificial Intelligence Research . 2018,第8期

机译：渐近时间差异学习：具有多项式样本复杂度的稳定强化学习
4. Functional Differences Between the Spatio-temporal Learning Rule (STLR) and Hebb Type (HEBB) in Single Pyramidal Cells in the Hippocampal CA1 Area [C] . Minoru Tsukada, Yoshiyuki Yamazaki Neural Information Processing pt.1; Lecture Notes in Computer Science; 4232 . 2006

机译：海马CA1区单个锥体细胞时空学习规则（STLR）和赫布类型（HEBB）之间的功能差异。
5. Rhythmic Action Synchronizes Memory Replay During Reinforcement Learning [D] . Roumis, Demetris . 2020

机译：节奏行动在强化学习期间同步内存重播
6. Improving Accuracy and Temporal Resolution of Learning Curve Estimation for within- and across-Session Analysis [O] . Matthias Deliano, Karsten Tabelow, Reinhard König, -1

机译：用于会话内和跨会话分析的学习曲线估计的准确性和时间分辨率的提高
7. Correlation minimizing replay memory in temporal-difference reinforcement learning [O] . Mirza Ramicic, Andrea Bonarini 2020

机译：在时间差异增强学习中最小化重播内存的相关性

Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model.

摘要

著录项

相似文献

相关主题

期刊订阅