Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

A. David Redish; Steve Jensen; Adam Johnson; Zeb Kurth-Nelson

首页> 外文期刊>Psychological Review >Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

【24h】

Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

机译：通过行为消灭和更新来协调强化学习模型：对成瘾，复发和问题赌博的影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Because learned associations are quickly renewed following extinction, the extinction process must include processes other than unlearning. However, reinforcement learning models, such as the temporal difference reinforcement learning (TDRL) model, treat extinction as an unlearning of associated value and are thus unable to capture renewal. TDRL models are based on the hypothesis that dopamine carries a reward prediction error signal; these models predict reward by driving that reward error to zero. The authors construct a TDRL model that can accommodate extinction and renewal through two simple processes: (a) a TDRL process that learns the value of situation-action pairs and (b) a situation recognition process that categorizes the observed cues into situations. This model has implications for dysfunctional states, including relapse after addiction and problem gambling.

机译：由于学习的联想会在消亡后迅速更新，因此消灭过程必须包括非学习过程。但是，强化学习模型（例如时间差异强化学习（TDRL）模型）将灭绝视为对关联值的未学习，因此无法捕获更新。 TDRL模型基于多巴胺携带奖励预测误差信号的假设。这些模型通过将奖励误差驱动为零来预测奖励。作者构建了一个可以通过两个简单过程来适应灭绝和更新的TDRL模型：（a）TDRL过程学习情境行为对的价值；（b）情境识别过程将观察到的线索分类为情境。该模型对功能障碍状态有影响，包括成瘾和问题赌博后的复发。

著录项

来源
《Psychological Review》 |2007年第3期|784-805|共22页
作者
A. David Redish; Steve Jensen; Adam Johnson; Zeb Kurth-Nelson;
展开▼
作者单位

Department of Neuroscience, University of Minnesota;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类心理学;
关键词
temporal difference reinforcement learning (TDRL); dopamine; reinstantiation; problem gambling;

机译：时差强化学习（TDRL）;多巴胺重新实例化;问题赌博;
入库时间 2022-08-17 23:50:21

相似文献

外文文献
中文文献
专利

1. The Importance of Animal Models of Decision Making, Gambling, and Related Behaviors: Implications for Translational Research in Addiction [J] . Marc N Potenza Neuropsychopharmacology . 2009,第13期

机译：决策，赌博和相关行为的动物模型的重要性：对成瘾性转化研究的启示
2. Gambling rats and gambling addiction: Reconciling the role of dopamine in irrationality [J] . SescousseG., denOudenH.E.M. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2013,第8期

机译：赌博大鼠与赌博成瘾：调和多巴胺在非理性中的作用
3. Relapse processes after the extinction of instrumental learning: Renewal, resurgence, and reacquisition [J] . BoutonM.E., WinterbauerN.E., ToddT.P. Behavioural processes . 2012,第1期

机译：工具学习灭绝后的复发过程：更新，复兴和重新获得
4. A Neurocomputational Model of Nicotine Addiction Based on Reinforcement Learning [C] . Selin Metin, Neslihan Serap Sengoer ICANN 2010;International conference on artificial neural networks . 2010

机译：基于强化学习的尼古丁成瘾神经计算模型
5. Reducing off-task behavior using noncontingent reinforcement without extinction. [D] . Riley, Jessica. 2012

机译：使用非偶然性增强材料来减少失职行为，而不会灭绝。
6. The Importance of Animal Models of Decision-Making Gambling and Related Behaviors: Implications for Translational Research in Addiction [O] . Marc N. Potenza -1

机译：决策赌博和相关行为的动物模型的重要性：对转化研究成瘾
7. Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling [O] . A. David Redish, Steve Jensen, Adam Johnson, 2007

机译：通过行为消亡和更新来协调强化学习模型：对成瘾，复发和问题赌博的影响

Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

摘要

著录项

相似文献

相关主题

期刊订阅