首页> 外文期刊>PLoS One >Computational modeling of choice-induced preference change: A Reinforcement-Learning-based approach
【24h】

Computational modeling of choice-induced preference change: A Reinforcement-Learning-based approach

机译:选择诱导的偏好变化的计算模型:基于加强学习的方法

获取原文
获取外文期刊封面目录资料

摘要

The value learning process has been investigated using decision-making tasks with a correct answer specified by the external environment (externally guided decision-making, EDM). In EDM, people are required to adjust their choices based on feedback, and the learning process is generally explained by the reinforcement learning (RL) model. In addition to EDM, value is learned through internally guided decision-making (IDM), in which no correct answer defined by external circumstances is available, such as preference judgment. In IDM, it has been believed that the value of the chosen item is increased and that of the rejected item is decreased (choice-induced preference change; CIPC). An RL-based model called the choice-based learning (CBL) model had been proposed to describe CIPC, in which the values of chosen and/or rejected items are updated as if own choice were the correct answer. However, the validity of the CBL model has not been confirmed by fitting the model to IDM behavioral data. The present study aims to examine the CBL model in IDM. We conducted simulations, a preference judgment task for novel contour shapes, and applied computational model analyses to the behavioral data. The results showed that the CBL model with both the chosen and rejected value’s updated were a good fit for the IDM behavioral data compared to the other candidate models. Although previous studies using subjective preference ratings had repeatedly reported changes only in one of the values of either the chosen or rejected items, we demonstrated for the first time both items’ value changes were based solely on IDM choice behavioral data with computational model analyses.
机译:使用具有由外部环境指定的正确答案的决策任务(外部导向决策,EDM)进行了调查了价值学习过程。在EDM中,人们需要根据反馈来调整其选择,并且通常通过增强学习(RL)模型来解释学习过程。除了EDM之外,通过内部导向的决策(IDM)学习价值,其中没有通过外部环境定义的正确答案,例如偏好判断。在IDM中,已经据信,所选项目的值增加,拒绝物品的值减少(选择诱导的偏好变化; CIPC)。已经提出了一种称为基于选择的学习(CBL)模型的基于RL的模型来描述CIPC,其中所选和/或拒绝物品的值被更新,就像自己的选择是正确的答案一样。然而,通过将模型拟合到IDM行为数据来确认CBL模型的有效性。本研究旨在检查IDM中的CBL模型。我们进行了模拟,新型轮廓形状的偏好判断任务,以及应用计算模型对行为数据分析。结果表明,与其他候选模型相比,具有所选和拒绝值更新的CBL模型对IDM行为数据有良好拟合。尽管以前使用主观偏好额定值的研究仅在所选或被拒绝物品的一个值中重复报告的变化,但我们首次展示两个项目的价值变化是完全基于IDM选择具有计算模型分析的行为数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号