首页> 外文期刊>Journal of Mathematical Psychology >A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes
【24h】

A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: A tribute to W.K. Estes

机译:强化学习和胜利-失败-失败转变决策过程的比较模型:向W.K.致敬埃斯蒂斯

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

W.K. Estes often championed an approach to model development whereby an existing model was augmented by the addition of one or more free parameters to account for additional psychological mechanisms. Following this same approach we utilized Estes' (1950) own augmented learning equations to improve the plausibility of a win-stay-lose-shift (WSLS) model that we have used in much of our recent work. We also improved the plausibility of a basic reinforcement-learning (RL) model by augmenting its assumptions. Estes also championed models that assumed a comparison between multiple concurrent cognitive processes. In line with this, we develop a WSLS-RL model that assumes that people have tendencies to stay with the same option or switch to a different option following trials with relatively good (''win'') or bad (''lose'') outcomes, and that the tendencies to stay or shift are adjusted based on the relative expected value of each option. Comparisons of simulations of the WSLS-RL model with data from three different decision-making experiments suggest that the WSLS-RL provides a good account of decisionmaking behavior. Our results also support the assertion that human participants weigh both the overall valence of the previous trial's outcome and the relative value of each option during decision-making.
机译:W.K. Estes经常拥护一种模型开发方法,该模型通过添加一个或多个自由参数来补充现有的心理机制来增强现有模型。遵循相同的方法,我们利用Estes(1950)自己的增强学习方程式来改善在我们最近的工作中使用的输赢-输赢(WSLS)模型的合理性。我们还通过增加其假设来改善基本强化学习(RL)模型的合理性。埃斯蒂斯还倡导了假设多个并发认知过程之间进行比较的模型。为此,我们开发了一个WSLS-RL模型,该模型假设人们倾向于在经历了相对较好(“获胜”)或较差(“失败”)的试验后仍选择相同的选项或转向其他选项。 )结果,并根据每种选择的相对期望值来调整停留或转移的趋势。 WSLS-RL模型的仿真与来自三个不同决策实验的数据的比较表明,WSLS-RL提供了决策行为的良好说明。我们的结果也支持这样的主张,即人类参与者既要权衡先前试验结果的总体效价,又要权衡决策过程中每种选择的相对价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号