首页> 外文会议>2017 International Conference on Machine Learning and Data Science >Modeling Choice Variation in Search Strategies with Multi-Armed Bandit Problems

【24h】

Modeling Choice Variation in Search Strategies with Multi-Armed Bandit Problems

机译：带有多武装强盗问题的搜索策略中的选择差异建模

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Prior research in decisions from experience (DFE) involving multi-armed bandit problems has used the sampling paradigm. In this paradigm, decision-makers search for information between multiple options before making a final consequential choice. Prior research in the sampling paradigm has accounted for information search and final choices using computational cognitive models. However, little is known on how cognitive models could account for final choices of participants with different exploration strategies in the presence or absence of an intermediate option. In this paper, we perform an individual-differences analysis and test the ability of computational models to explain final choices of participants with different exploration strategies in the absence or presence of an intermediate option. Specifically, we take an Instance-Based Learning (IBL) model, which relies on recency and frequency processes, and we calibrate this model to final choices of participants exhibiting more-switching (piecewise strategy) or less-switching (comprehensive strategy) between options in different problems. Also, a Natural Mean Heuristic (NMH) model, relying on frequency of experienced outcomes, is used as a baseline. Results revealed that both IBL and NMH models explained aggregate and individual choices better when participants followed piecewise strategy compared to the comprehensive strategy. Overall, the IBL model, calibrated to individual participants using a single set of parameters, performed better compared to the NMH model. We highlight the implications of our results for DFE research involving exploration before consequential decisions.

机译：先前关于涉及多武装匪徒问题的经验决策（DFE）的研究已使用采样范式。在这种范式中，决策者在做出最终结果选择之前先在多个选项之间搜索信息。抽样范式的先前研究已考虑了使用计算认知模型进行的信息搜索和最终选择。但是，关于认知模型如何在存在或不存在中间选择的情况下如何解释具有不同探索策略的参与者的最终选择知之甚少。在本文中，我们进行了个体差异分析，并测试了计算模型的能力，以解释在没有或没有中间选择的情况下采用不同探索策略的参与者的最终选择。具体而言，我们采用了基于实例的学习（IBL）模型，该模型依赖于新近度和频率过程，并将该模型校准为参与者的最终选择，这些参与者在选择之间表现出更大的切换（逐段策略）或更少的切换（全面策略）在不同的问题。此外，依赖于经验结果频率的自然平均启发式（NMH）模型被用作基准。结果表明，与综合策略相比，当参与者遵循分段策略时，IBL和NMH模型都可以更好地解释总体和个人选择。总体而言，与NMH模型相比，使用单个参数集对单个参与者进行了校准的IBL模型的性能更好。我们着重指出我们的结果对DFE研究的意义，包括在做出相应决定之前进行的勘探。

著录项

来源
《2017 International Conference on Machine Learning and Data Science 》|2017年|91-97|共7页
会议地点 Noida(IN)
作者
Neha Sharma; Varun Dutt;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Computational modeling; Switches; Search problems; Aggregates; Resource management; Investment; Probabilistic logic;

机译：计算建模;开关;搜索问题;聚集;资源管理;投资;概率逻辑;;

相似文献

外文文献
中文文献
专利

1. Truthful multi-armed bandit mechanisms for multi-slot sponsored search auctions [J] . Akash Das Sharma, Sujit Gujar, Y. Narahari Current Science: A Fortnightly Journal of Research . 2012 ,第9期

机译：真实的多臂强盗机制，用于多位置赞助的搜索拍卖
2. Truthful multi-armed bandit mechanisms for multi-slot sponsored search auctions [J] . Akash Das Sharma, Sujit Gujar, Y. Narahari Current Science: A Fortnightly Journal of Research . 2012 ,第9期

机译：真实的多臂强盗机制，用于多位置赞助的搜索拍卖
3. Variational inference for the multi-armed contextual bandit [J] . I?igo Urteaga, Chris Wiggins JMLR: Workshop and Conference Proceedings . 2018 ,第3期

机译：多臂上下文强盗的变分推理
4. Modeling Choice Variation in Search Strategies with Multi-Armed Bandit Problems [C] . Neha Sharma, Varun Dutt International Conference on Machine Learning and Data Science . 2017

机译：多武装强盗问题的搜索策略选择变化
5. Behavioral models of strategies in multi-armed bandit problems. [D] . Anderson, Christopher Madden. 2001

机译：多武装匪徒问题中策略的行为模型。
6. Multi-armed Bandit Models for the Optimal Design of Clinical Trials: Benefits and Challenges [O] . Sofía S. Villar, Jack Bowden, James Wason -1

机译：用于临床试验优化设计的多臂Bandit模型：好处和挑战
7. CCN interest forwarding strategy as Multi-Armed Bandit model with delays [O] . Avrachenkov Konstantin, Jacko Peter 2012

机译：具有延迟的多武装强盗模型的CCN兴趣转发策略

Modeling Choice Variation in Search Strategies with Multi-Armed Bandit Problems

摘要

著录项

相似文献

相关主题

期刊订阅