A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes

Darrell A. Worthy; W. Todd Maddox

首页> 外文期刊>Journal of Mathematical Psychology >A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes

【24h】

A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes

机译：强化学习和胜利-失败-失败转变决策过程的比较模型：向W.K.致敬埃斯蒂斯

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

W.K. Estes often championed an approach to model development whereby an existing model was augmented by the addition of one or more free parameters to account for additional psychological mechanisms. Following this same approach we utilized Estes' (1950) own augmented learning equations to improve the plausibility of a win-stay-lose-shift (WSLS) model that we have used in much of our recent work. We also improved the plausibility of a basic reinforcement-learning (RL) model by augmenting its assumptions. Estes also championed models that assumed a comparison between multiple concurrent cognitive processes. In line with this, we develop a WSLS-RL model that assumes that people have tendencies to stay with the same option or switch to a different option following trials with relatively good (''win'') or bad (''lose'') outcomes, and that the tendencies to stay or shift are adjusted based on the relative expected value of each option. Comparisons of simulations of the WSLS-RL model with data from three different decision-making experiments suggest that the WSLS-RL provides a good account of decisionmaking behavior. Our results also support the assertion that human participants weigh both the overall valence of the previous trial's outcome and the relative value of each option during decision-making.

机译：W.K. Estes经常拥护一种模型开发方法，该模型通过添加一个或多个自由参数来补充现有的心理机制来增强现有模型。遵循相同的方法，我们利用Estes（1950）自己的增强学习方程式来改善在我们最近的工作中使用的输赢-输赢（WSLS）模型的合理性。我们还通过增加其假设来改善基本强化学习（RL）模型的合理性。埃斯蒂斯还倡导了假设多个并发认知过程之间进行比较的模型。为此，我们开发了一个WSLS-RL模型，该模型假设人们倾向于在经历了相对较好（“获胜”）或较差（“失败”）的试验后仍选择相同的选项或转向其他选项。）结果，并根据每种选择的相对期望值来调整停留或转移的趋势。 WSLS-RL模型的仿真与来自三个不同决策实验的数据的比较表明，WSLS-RL提供了决策行为的良好说明。我们的结果也支持这样的主张，即人类参与者既要权衡先前试验结果的总体效价，又要权衡决策过程中每种选择的相对价值。

著录项

来源
《Journal of Mathematical Psychology》 |2014年第null期|共9页
作者
Darrell A. Worthy; W. Todd Maddox;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数理心理学、心理统计法;
关键词
Decision-making; Dual-process; Mathematical modeling; Win-stay-lose-shift; Reinforcement learning;

机译：决策;双过程;数学建模;输赢;加强学习;

相似文献

外文文献
中文文献
专利

1. A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes [J] . Darrell A. Worthy, W. Todd Maddox Journal of Mathematical Psychology . 2014,第Null期

机译：强化学习和胜利-失败-失败转变决策过程的比较模型：向W.K.致敬埃斯蒂斯
2. The effect of selfie promotion and celebrity endorsed advertisement on decision-making processes A model comparison [J] . Cheah Jun-Hwa, Ting Hiram, Cham Tat Huei, Internet Research: Electronic Networking Applications and Policy . 2019,第3期

机译：自拍照促销和名人赞同广告的效果在决策过程中的模型比较
3. A tribute to William Kaye Estes (1919-2011) [J] . William H. Batchelder Journal of Mathematical Psychology . 2014,第Null期

机译：致敬William Kaye Estes（1919-2011）
4. Modeling of Decision-Making Processes in Project Planning Based on Predictive Analytic Method [C] . Nataliia Yehorchenkova, Oleksii Yehorchenkov International Conference on Data Stream Mining Processing . 2020

机译：基于预测分析方法的项目计划决策过程建模
5. A Comparison Study of the Decision-Making Processes of Family Court Judges and Child-Custody Evaluators in Southern California. [D] . LoCascio, Christopher Laurence. 2011

机译：南加州家庭法院法官和儿童监护权评估者决策过程的比较研究。
6. A Comparison Model of Reinforcement-Learning and Win-Stay-Lose-Shift Decision-Making Processes: A Tribute to W.K. Estes [O] . Darrell A. Worthy, W. Todd Maddox -1

机译：强化学习和胜任-失败-转变决策过程的比较模型：向W.K.致敬埃斯蒂斯
7. A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes [O] . Darrell A. Worthy, W. Todd Maddox 2014

机译：加强学习与持续止回决策过程的比较模型：对W.K的致敬。埃斯特雷斯

A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅