Variability in Dopamine Genes Dissociates Model-Based and Model-Free Reinforcement Learning

机译：多巴胺基因的变异性会分离基于模型的和无模型的强化学习

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Considerable evidence suggests that multiple learning systems can drive behavior. Choice can proceed reflexively from previous actions and their associated outcomes, as captured by “model-free” learning algorithms, or flexibly from prospective consideration of outcomes that might occur, as captured by “model-based” learning algorithms. However, differential contributions of dopamine to these systems are poorly understood. Dopamine is widely thought to support model-free learning by modulating plasticity in striatum. Model-based learning may also be affected by these striatal effects, or by other dopaminergic effects elsewhere, notably on prefrontal working memory function. Indeed, prominent demonstrations linking striatal dopamine to putatively model-free learning did not rule out model-based effects, whereas other studies have reported dopaminergic modulation of verifiably model-based learning, but without distinguishing a prefrontal versus striatal locus. To clarify the relationships between dopamine, neural systems, and learning strategies, we combine a genetic association approach in humans with two well-studied reinforcement learning tasks: one isolating model-based from model-free behavior and the other sensitive to key aspects of striatal plasticity. Prefrontal function was indexed by a polymorphism in the COMT gene, differences of which reflect dopamine levels in the prefrontal cortex. This polymorphism has been associated with differences in prefrontal activity and working memory. Striatal function was indexed by a gene coding for DARPP-32, which is densely expressed in the striatum where it is necessary for synaptic plasticity. We found evidence for our hypothesis that variations in prefrontal dopamine relate to model-based learning, whereas variations in striatal dopamine function relate to model-free learning.>SIGNIFICANCE STATEMENT Decisions can stem reflexively from their previously associated outcomes or flexibly from deliberative consideration of potential choice outcomes. Research implicates a dopamine-dependent striatal learning mechanism in the former type of choice. Although recent work has indicated that dopamine is also involved in flexible, goal-directed decision-making, it remains unclear whether it also contributes via striatum or via the dopamine-dependent working memory function of prefrontal cortex. We examined genetic indices of dopamine function in these regions and their relation to the two choice strategies. We found that striatal dopamine function related most clearly to the reflexive strategy, as previously shown, and that prefrontal dopamine related most clearly to the flexible strategy. These findings suggest that dissociable brain regions support dissociable choice strategies.

机译：大量证据表明，多种学习系统可以推动行为。选择可以根据“无模型”学习算法捕获的先前动作及其相关结果进行反思，或者根据“基于模型”学习算法捕获的可能发生的结果的前瞻性思考进行灵活选择。但是，人们对多巴胺对这些系统的不同贡献知之甚少。多巴胺被广泛认为可通过调节纹状体的可塑性来支持无模型学习。基于模型的学习也可能受到这些纹状体效应或其他地方的其他多巴胺能效应的影响，特别是对前额叶工作记忆功能的影响。确实，将纹状体多巴胺与假定的无模型学习联系起来的显着论据并未排除基于模型的影响，而其他研究则报道了可验证的基于模型的学习的多巴胺能调节，但没有区分前额叶和纹状体。为了阐明多巴胺，神经系统和学习策略之间的关系，我们将人类的遗传关联方法与两项经过充分研究的强化学习任务相结合：一种将基于模型的行为与无模型行为隔离开来，另一种对纹状体的关键方面敏感可塑性。前额叶功能是通过COMT基因的多态性来索引的，其差异反映了前额叶皮层中的多巴胺水平。这种多态性与前额叶活动和工作记忆的差异有关。纹状体功能由编码DARPP-32的基因索引，该基因在纹状体中密集表达，而突触可塑性是必需的。我们发现了以下假设的证据：前额叶多巴胺的变化与基于模型的学习有关，而纹状体多巴胺功能的变化与无模型的学习有关。>意义声明，决策可以反过来源于其先前相关的结果或灵活地考虑潜在选择结果。研究表明在前一种选择中多巴胺依赖性纹状体学习机制。尽管最近的工作表明多巴胺也参与了灵活的，目标导向的决策，但仍不清楚它是否也通过纹状体或前额叶皮层的多巴胺依赖性工作记忆功能来发挥作用。我们检查了这些地区多巴胺功能的遗传指标及其与两种选择策略的关系。如前所述，我们发现纹状体多巴胺功能与反射策略最明显相关，前额叶多巴胺与弹性策略最明显相关。这些发现表明，可分离的大脑区域支持可分离的选择策略。

著录项

期刊名称 The Journal of Neuroscience
作者
Bradley B. Doll; Kevin G. Bath; Nathaniel D. Daw; Michael J. Frank;
展开▼
作者单位

展开▼
年(卷),期 2016(36),4
年度 2016
页码 1211–1222
总页数 12
原文格式 PDF
正文语种
中图分类神经科学;
关键词
decision-making dopamine genetics reinforcement learning;

机译：决策;多巴胺;遗传学;强化学习;

相似文献

外文文献
中文文献
专利

1. Variability in Dopamine Genes Dissociates Model-Based and Model-Free Reinforcement Learning [J] . Doll Bradley B., Bath Kevin G., Daw Nathaniel D., The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2016,第4期

机译：多巴胺基因的变异性会分离基于模型的和无模型的强化学习
2. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. [J] . Glascher J, Daw N, Dayan P, Neuron . 2010,第4期

机译：状态与回报：基于模型和无模型的强化学习背后的可分离的神经预测错误信号。
3. The "Proactive" Model of Learning: Integrative Framework for Model-Free and Model-Based Reinforcement Learning Utilizing the Associative Learning-Based Proactive Brain Concept [J] . Zsuga Judit, Biro Klara, Papp Csaba, Behavioral neuroscience . 2016,第1期

机译：“主动”学习模型：利用基于联合学习的主动脑概念进行无模型和基于模型的强化学习的集成框架
4. EEG-based classification of learning strategies : Model-based and model-free reinforcement learning [C] . Dongjae Kim, Charles Weston, Sang Wan Lee 2018 6th International Conference on Brain-Computer Interface . 2018

机译：基于脑电图的学习策略分类：基于模型和无模型的强化学习
5. Model-based Bayesian Reinforcement Learning with Generalized Priors [D] . Asmuth, John Thomas 2013

机译：具有广义先验的基于模型的贝叶斯强化学习
6. States versus Rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning [O] . Jan Gläscher, Nathaniel Daw, Peter Dayan, -1

机译：各种与奖励：可解离的神经预测误差信号底层模型和无模型加强学习
7. States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning [O] . Gläscher Jan, Daw Nathaniel, Dayan Peter, 2010

机译：状态与奖励：基于模型和免费模型的强化学习背后的可分离神经预测误差信号

Variability in Dopamine Genes Dissociates Model-Based and Model-Free Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅