首页> 外文会议>Annual conference on Neural Information Processing Systems >Multiple-Play Bandits in the Position-Based Model
【24h】

Multiple-Play Bandits in the Position-Based Model

机译:基于位置的模型中的多重游戏强盗

获取原文

摘要

Sequentially learning to place items in multi-position displays or lists is a task that can be cast into the multiple-play semi-bandit setting. However, a major concern in this context is when the system cannot decide whether the user feedback for each item is actually exploitable. Indeed, much of the content may have been simply ignored by the user. The present work proposes to exploit available information regarding the display position bias under the so-called Position-based click model (PBM). We first discuss how this model differs from the Cascade model and its variants considered in several recent works on multiple-play bandits. We then provide a novel regret lower bound for this model as well as computationally efficient algorithms that display good empirical and theoretical performance.
机译:顺序学习将项目放置在多位置显示或列表中是一项任务,可以将其转换为多重播放半强盗设置。但是,在这种情况下,主要的担忧是系统无法确定是否可以实际利用每个项目的用户反馈。实际上,许多内容可能已经被用户简单地忽略了。本工作提出在所谓的基于位置的点击模型(PBM)下利用关于显示位置偏差的可用信息。我们首先讨论该模型与Cascade模型的区别,以及最近在多部老虎机上进行的几项研究中考虑的变体。然后,我们为此模型提供了一个新颖的遗憾下限,以及显示出良好的经验和理论性能的高效计算算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号