首页> 美国卫生研究院文献>The Journal of Neuroscience >Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based But Not Model-Free Reinforcement Learning

【2h】

Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based But Not Model-Free Reinforcement Learning

机译：基于模型而不是无模型的强化学习都需要腹侧纹状体和眶额皮质

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In many cases, learning is thought to be driven by differences between the value of rewards we expect and rewards we actually receive. Yet learning can also occur when the identity of the reward we receive is not as expected, even if its value remains unchanged. Learning from changes in reward identity implies access to an internal model of the environment, from which information about the identity of the expected reward can be derived. As a result, such learning is not easily accounted for by model-free reinforcement learning theories such as temporal difference reinforcement learning (TDRL), which predicate learning on changes in reward value, but not identity. Here, we used unblocking procedures to assess learning driven by value- versus identity-based prediction errors. Rats were trained to associate distinct visual cues with different food quantities and identities. These cues were subsequently presented in compound with novel auditory cues and the reward quantity or identity was selectively changed. Unblocking was assessed by presenting the auditory cues alone in a probe test. Consistent with neural implementations of TDRL models, we found that the ventral striatum was necessary for learning in response to changes in reward value. However, this area, along with orbitofrontal cortex, was also required for learning driven by changes in reward identity. This observation requires that existing models of TDRL in the ventral striatum be modified to include information about the specific features of expected outcomes derived from model-based representations, and that the role of orbitofrontal cortex in these models be clearly delineated.

机译：在许多情况下，人们认为学习是由我们期望的奖励价值与我们实际获得的奖励之间的差异驱动的。然而，当我们获得的奖励的身份与预期不符时，即使其价值保持不变，也会发生学习。从奖励身份的变化中学习意味着可以访问环境的内部模型，从中可以得出有关预期奖励的身份的信息。结果，这种学习不容易被无模型的强化学习理论（如时差强化学习（TDRL））所解释，该理论基于学习奖励价值的变化而不是身份。在这里，我们使用了无障碍程序来评估由基于价值和基于身份的预测错误驱动的学习。对大鼠进行了训练，使不同的视觉提示与不同的食物数量和身份相关联。这些提示随后与新颖的听觉提示一起呈现，并且奖励数量或身份被有选择地改变。通过在探查测试中单独呈现听觉提示来评估疏通程度。与TDRL模型的神经实现相一致，我们发现腹侧纹状体对于响应奖励价值变化的学习是必要的。但是，该区域以及眶额皮质也是奖励身份变化驱动的学习所必需的。该观察要求对腹侧纹状体中的TDRL现有模型进行修改，以包括有关从基于模型的表示中得出的预期结果的特定特征的信息，并明确描述眶额皮质在这些模型中的作用。

著录项

期刊名称 The Journal of Neuroscience
作者
Michael A. McDannald; Federica Lucantonio; Kathryn A. Burke; Yael Niv; Geoffrey Schoenbaum;
展开▼
作者单位

展开▼
年(卷),期 2011(31),7
年度 2011
页码 2700–2705
总页数 6
原文格式 PDF
正文语种
中图分类神经科学;
关键词

相似文献

外文文献
中文文献
专利

1. Transition from 'model-based' to 'model-free' behavioral control in addiction: Involvement of the orbitofrontal cortex and dorsolateral striatum [J] . LucantonioF., CaprioliD., SchoenbaumG. Neuropharmacology . 2014,第Pta2期

机译：从成瘾的“基于模型”的行为控制过渡到“无模型”的行为控制：眶额皮质和背外侧纹状体的参与
2. Model-based learning and the contribution of the orbitofrontal cortex to the model-free world [J] . McdannaldM.A., TakahashiY.K., LopatinaN., The European Journal of Neuroscience . 2012,第7a8期

机译：基于模型的学习以及眶额皮质对无模型世界的贡献
3. Operant learning requires NMDA-receptor activation in the anterior cingulate cortex and dorsomedial striatum, but not in the orbitofrontal cortex [J] . McKee B.L., Kelley A.E., Moser H.R., Behavioral neuroscience . 2010,第4期

机译：操作学习需要在前扣带回皮层和背侧纹状体中激活NMDA受体，但在眶额皮层中则不需要
4. Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning [C] . Bruno B. Averbeck IEEE Symposium Series on Computational Intelligence . 2017

机译：杏仁核和腹侧纹状体人口代码实施多种学习率以进行强化学习
5. Neural computations underlying value learning in the ventral tegmental area and orbitofrontal cortex of rhesus macaques. [D] . Grattan, Lauren Elizabeth. 2014

机译：恒河猴猕猴腹侧被盖区和眶额皮质的价值学习基础的神经计算。
6. Transition from ‘model-based’ to ‘model-free’ behavioral control in addiction: involvement of the orbitofrontal cortex and dorsolateral striatum [O] . Federica Lucantonio, Daniele Caprioli, Geoffrey Schoenbaum -1

机译：成瘾从基于模型的行为控制过渡到无模型的行为控制：眶额皮质和背外侧纹状体的参与
7. Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based, But Not Model-Free, Reinforcement Learning [O] . Michael A. Mcdannald, Federica Lucantonio, Kathryn A. Burke, 2012

机译：基于模型而不是无模型的强化学习都需要腹侧纹状体和眶额皮质

Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based But Not Model-Free Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅