首页> 外文期刊>Journal of Mathematical Psychology >Optimal experimental design for a class of bandit problems
【24h】

Optimal experimental design for a class of bandit problems

机译:一类强盗问题的最佳实验设计

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Bandit problems are a class of sequential decision-making problems that are useful for studying human decision-making, especially in the context of understanding how people balance exploration with exploitation. A major goal of measuring people's behavior using bandit problems is to distinguish between competing models of their decision-making. This raises a question of experimental design: How should a set of bandit problems be designed to maximize the ability to discriminate between models? We apply a previously developed design optimization framework to the problem of finding good bandit problem experiments, and develop computational sampling schemes for implementing the approach. We demonstrate the approach in a number of simple cases, varying the priors on parameters for some standard models. We also demonstrate the approach using empirical priors, inferred by hierarchical Bayesian analysis from human data, and show that optimally designed bandit problems significantly enhance the ability to discriminate between competing models.
机译:土匪问题是一类顺序决策问题,对于研究人类决策特别是在了解人们如何平衡勘探与开发之间的关系中非常有用。使用强盗问题来衡量人们的行为的主要目标是区分决策的竞争模型。这就提出了一个实验设计问题:应该如何设计一组匪徒问题以最大程度地区分模型?我们将先前开发的设计优化框架应用于发现良好的匪徒问题实验的问题,并开发用于实现该方法的计算抽样方案。我们在许多简单情况下演示了该方法,并更改了一些标准模型的参数先验。我们还演示了使用经验先验方法,该方法是根据人类数据进行的分级贝叶斯分析得出的,并表明优化设计的匪徒问题显着增强了区分竞争模型的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号