首页> 外文期刊>Neurocomputing >A numerical analysis of allocation strategies for the multi-armed bandit problem under delayed rewards conditions in digital campaign management
【24h】

A numerical analysis of allocation strategies for the multi-armed bandit problem under delayed rewards conditions in digital campaign management

机译:数字战役管理中延迟奖励条件下多臂匪问题分配策略的数值分析

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we analyze the most representative allocation strategies to deal with the multi-armed bandit problem in a context with delayed rewards by means of a numerical study based on a discrete event simulation. The scenario that we address is a digital marketing content recommendation system, called campaign management, used by marketers to create specific digital content that can be issued or configured for viewing by certain population segments according to a series of business variables, user profile or behavior. Both batch mode and online update architectures are considered for feedback from the different contents displayed to users. The results show that possibilistic reward (PR) methods outperform other allocation strategies in this scenario with delayed rewards. (C) 2019 Elsevier B.V. All rights reserved.
机译:在本文中,我们通过基于离散事件模拟的数值研究,分析了在具有延迟奖励的情况下解决多臂匪问题的最具代表性的分配策略。我们要解决的方案是一个称为营销活动管理的数字营销内容推荐系统,市场营销人员使用该系统来创建特定的数字内容,该数字内容可以根据一系列业务变量,用户个人资料或行为发布或配置为某些人群细分查看。批处理模式和联机更新体系结构都将考虑从显示给用户的不同内容中获取反馈。结果表明,在这种情况下,延迟奖励可能会导致奖励(PR)优于其他分配策略。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号