首页> 外文会议>2017 International Conference on Machine Learning and Data Science >Modeling Choice Variation in Search Strategies with Multi-Armed Bandit Problems
【24h】

Modeling Choice Variation in Search Strategies with Multi-Armed Bandit Problems

机译:带有多武装强盗问题的搜索策略中的选择差异建模

获取原文
获取原文并翻译 | 示例

摘要

Prior research in decisions from experience (DFE) involving multi-armed bandit problems has used the sampling paradigm. In this paradigm, decision-makers search for information between multiple options before making a final consequential choice. Prior research in the sampling paradigm has accounted for information search and final choices using computational cognitive models. However, little is known on how cognitive models could account for final choices of participants with different exploration strategies in the presence or absence of an intermediate option. In this paper, we perform an individual-differences analysis and test the ability of computational models to explain final choices of participants with different exploration strategies in the absence or presence of an intermediate option. Specifically, we take an Instance-Based Learning (IBL) model, which relies on recency and frequency processes, and we calibrate this model to final choices of participants exhibiting more-switching (piecewise strategy) or less-switching (comprehensive strategy) between options in different problems. Also, a Natural Mean Heuristic (NMH) model, relying on frequency of experienced outcomes, is used as a baseline. Results revealed that both IBL and NMH models explained aggregate and individual choices better when participants followed piecewise strategy compared to the comprehensive strategy. Overall, the IBL model, calibrated to individual participants using a single set of parameters, performed better compared to the NMH model. We highlight the implications of our results for DFE research involving exploration before consequential decisions.
机译:先前关于涉及多武装匪徒问题的经验决策(DFE)的研究已使用采样范式。在这种范式中,决策者在做出最终结果选择之前先在多个选项之间搜索信息。抽样范式的先前研究已考虑了使用计算认知模型进行的信息搜索和最终选择。但是,关于认知模型如何在存在或不存在中间选择的情况下如何解释具有不同探索策略的参与者的最终选择知之甚少。在本文中,我们进行了个体差异分析,并测试了计算模型的能力,以解释在没有或没有中间选择的情况下采用不同探索策略的参与者的最终选择。具体而言,我们采用了基于实例的学习(IBL)模型,该模型依赖于新近度和频率过程,并将该模型校准为参与者的最终选择,这些参与者在选择之间表现出更大的切换(逐段策略)或更少的切换(全面策略)在不同的问题。此外,依赖于经验结果频率的自然平均启发式(NMH)模型被用作基准。结果表明,与综合策略相比,当参与者遵循分段策略时,IBL和NMH模型都可以更好地解释总体和个人选择。总体而言,与NMH模型相比,使用单个参数集对单个参与者进行了校准的IBL模型的性能更好。我们着重指出我们的结果对DFE研究的意义,包括在做出相应决定之前进行的勘探。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号