首页> 外文会议>European Conference on Multi-Agent Systems;International Conference on Agreement Technologies >Incentivising Exploration and Recommendations for Contextual Bandits with Payments
【24h】

Incentivising Exploration and Recommendations for Contextual Bandits with Payments

机译:通货方式的激励探索和建议

获取原文

摘要

We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users. By using payments to incentivize these agents to explore different items/recommendations, we show how the platform can learn the inherent attributes of items and achieve a sublinear regret while maximizing cumulative social welfare. We also calculate theoretical bounds on the cumulative costs of incentivization to the platform. Unlike previous works in this domain, we consider contexts to be completely adversarial, and the behavior of the adversary is unknown to the platform. Our approach can improve various engagement metrics of users on e-commerce stores, recommendation engines and matching platforms.
机译:我们提出了一种基于语境的匪盗模型,可以在近视用户的存在下捕获Web平台的学习和社会福利目标。 通过使用付款来激励这些代理商来探索不同的项目/建议,我们展示了平台如何学习物品的固有属性,并在最大化累积社会福利时实现索姆林的遗憾。 我们还计算了对平台激励激励成本的理论界。 与以前的工作不同,我们认为上下文是完全对抗的,并且对手的行为是未知的平台。 我们的方法可以改善电子商务商店,推荐发动机和匹配平台上的用户的各种参与度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号