Incentivising Exploration and Recommendations for Contextual Bandits with Payments

机译：通货方式的激励探索和建议

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users. By using payments to incentivize these agents to explore different items/recommendations, we show how the platform can learn the inherent attributes of items and achieve a sublinear regret while maximizing cumulative social welfare. We also calculate theoretical bounds on the cumulative costs of incentivization to the platform. Unlike previous works in this domain, we consider contexts to be completely adversarial, and the behavior of the adversary is unknown to the platform. Our approach can improve various engagement metrics of users on e-commerce stores, recommendation engines and matching platforms.

机译：我们提出了一种基于语境的匪盗模型，可以在近视用户的存在下捕获Web平台的学习和社会福利目标。通过使用付款来激励这些代理商来探索不同的项目/建议，我们展示了平台如何学习物品的固有属性，并在最大化累积社会福利时实现索姆林的遗憾。我们还计算了对平台激励激励成本的理论界。与以前的工作不同，我们认为上下文是完全对抗的，并且对手的行为是未知的平台。我们的方法可以改善电子商务商店，推荐发动机和匹配平台上的用户的各种参与度量。

著录项

来源
《European Conference on Multi-Agent Systems;International Conference on Agreement Technologies》|2020年|159-170|共12页
会议地点
作者
Priyank Agrawal; Theja Tulabandhula;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Multi agent learning; Contextual bandit; Incentivizing exploration;

机译：多代理学习;情境匪徒;激励探索;

相似文献

外文文献
中文文献
专利

1. Contextual Bandit Approach-based Recommendation System for Personalized Web-based Services [J] . Pilani Akshay, Mathur Kritagya, Agrawal Himanshu, Applied Artificial Intelligence . 2021,第5a8期

机译：基于语调的基于Web的服务的方法 - 基于Birt方法的推荐系统
2. Contextual Bandits With Hidden Features to Online Recommendation via Sparse Interactions [J] . Shangdong Yang, Hao Wang, Chenyu Zhang, IEEE intelligent systems . 2020,第5期

机译：通过稀疏交互与在线推荐的隐藏功能的上下文匪徒
3. Safe Exploration for Optimizing Contextual Bandits [J] . ROLF JAGERMAN, ILYA MARKOV, MAARTEN DE RIJKE ACM Transactions on Information Systems . 2020,第3期

机译：优化上下文匪徒的安全探索
4. Contextual Bandit Learning for Activity-Aware Things-of-Interest Recommendation in an Assisted Living Environment [C] . May S. Altulayan, Chaoran Huang, Lina Yao, Australasian Database Conference . 2021

机译：在辅助生活环境中的活动意识到活动意见的情境匪徒学习
5. Adaptive Preference Learning with Bandit Feedback: Information Filtering, Dueling Bandits and Incentivizing Exploration [D] . Chen, Bangrui. 2017

机译：带有土匪反馈的自适应偏好学习：信息过滤，决斗土匪和激励探索
6. Are Pharmaceutical Company Payments Incentivising Malpractice in Japanese Physicians? [O] . Yurie Kobashi, Makoto Watanabe, Hideaki Kimura, 2019

机译：制药公司付款是否会刺激日本医师的医疗事故？
7. Incentivising Exploration and Recommendations for Contextual Bandits with Payments [O] . Priyank Agrawal, Theja Tulabandhula 2020

机译：宣传探索和支付上下文匪徒的建议

Incentivising Exploration and Recommendations for Contextual Bandits with Payments

摘要

著录项

相似文献

相关主题

期刊订阅