...
首页> 外文期刊>Mobile networks & applications >Collaborative Thompson Sampling
【24h】

Collaborative Thompson Sampling

机译:合作汤普森抽样

获取原文
获取原文并翻译 | 示例

摘要

Thompson sampling is one of the most effective strategies to balance exploration-exploitation trade-off. It has been applied in a variety of domains and achieved remarkable success. Thompson sampling makes decisions in a noisy but stationary environment by accumulating uncertain information over time to improve prediction accuracy. In highly dynamic domains, however, the environment undergoes frequent and unpredictable changes. Making decisions in such an environment should rely on current information. Therefore, standard Thompson sampling may perform poorly in these domains. Here we present collaborative Thompson sampling to apply the exploration-exploitation strategy to highly dynamic settings. The algorithm takes collaborative effects into account by dynamically clustering users into groups, and the feedback of all users in the same group will help to estimate the expected reward in the current context to find the optimal choice. Incorporating collaborative effects into Thompson sampling allows to capture real-time changes of the environment and adjust decision making strategy accordingly. We compare our algorithm with standard Thompson sampling algorithms on two real-world datasets. Our algorithm shows accelerated convergence and improved prediction performance in collaborative environments. We also provide regret analyses of our algorithm in both contextual and non-contextual settings.
机译:汤普森抽样是平衡勘探开发权衡最有效的策略之一。它已应用于各种域,取得了显着的成功。汤普森采样通过随着时间的推移累积不确定的信息来提高预测准确性,在嘈杂但静止环境中做出决定。然而,在高度动态域中,环境经历频繁和不可预测的变化。在这种环境中做出决定应依赖当前信息。因此,标准汤普森采样可以在这些域中表现不佳。在这里,我们提出了合作汤普森采样,将探索开发策略应用于高度动态的设置。该算法通过将用户动态聚类为组来考虑协作效果,同一组中所有用户的反馈将有助于估计当前上下文中的预期奖励以找到最佳选择。将协作效果纳入汤普森采样允许捕获环境的实时变化,并相应地调整决策策略。我们将算法与标准汤普森采样算法进行比较两个真实世界数据集。我们的算法显示了加速的收敛性和改进的协作环境中的预测性能。我们还在上下文和非上下文设置中提供了我们的算法的遗憾分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号