A numerical analysis of allocation strategies for the multi-armed bandit problem under delayed rewards conditions in digital campaign management

Martin Miguel; Jimenez-Martin Antonio; Mateos Alfonso

首页> 外文期刊>Neurocomputing >A numerical analysis of allocation strategies for the multi-armed bandit problem under delayed rewards conditions in digital campaign management

【24h】

A numerical analysis of allocation strategies for the multi-armed bandit problem under delayed rewards conditions in digital campaign management

机译：数字战役管理中延迟奖励条件下多臂匪问题分配策略的数值分析

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we analyze the most representative allocation strategies to deal with the multi-armed bandit problem in a context with delayed rewards by means of a numerical study based on a discrete event simulation. The scenario that we address is a digital marketing content recommendation system, called campaign management, used by marketers to create specific digital content that can be issued or configured for viewing by certain population segments according to a series of business variables, user profile or behavior. Both batch mode and online update architectures are considered for feedback from the different contents displayed to users. The results show that possibilistic reward (PR) methods outperform other allocation strategies in this scenario with delayed rewards. (C) 2019 Elsevier B.V. All rights reserved.

机译：在本文中，我们通过基于离散事件模拟的数值研究，分析了在具有延迟奖励的情况下解决多臂匪问题的最具代表性的分配策略。我们要解决的方案是一个称为营销活动管理的数字营销内容推荐系统，市场营销人员使用该系统来创建特定的数字内容，该数字内容可以根据一系列业务变量，用户个人资料或行为发布或配置为某些人群细分查看。批处理模式和联机更新体系结构都将考虑从显示给用户的不同内容中获取反馈。结果表明，在这种情况下，延迟奖励可能会导致奖励（PR）优于其他分配策略。（C）2019 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2019年第21期|99-113|共15页
作者
Martin Miguel; Jimenez-Martin Antonio; Mateos Alfonso;
展开▼
作者单位

Univ Politecn Madrid Dept Inteligencia Artificial Campus Montegancedo S-N Boadilla Del Monte Madri 28660 Spain;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-armed bandit problem; Delayed reward; Numerical study; Digital campaign management;

机译：多臂强盗问题;延迟奖励;数值研究;数字营销活动管理;

相似文献

外文文献
中文文献
专利

1. Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards [J] . Arya Sakshi, Yang Yuhong Statistics & Probability Letters . 2020,第1期

机译：随机分配与延迟奖励的上下文多武装匪徒的非参数分配
2. Best arm identification in multi-armed bandits with delayed feedback [J] . Aditya Grover, Todor Markov, Peter Attia, JMLR: Workshop and Conference Proceedings . 2018,第3期

机译：具有延迟反馈的多臂匪的最佳臂识别
3. Priority index heuristic for multi-armed bandit problems with set-up costs and/or set-up time delays [J] . F. DUSONCHET, M.-O. HONGLER International Journal of Computer Integrated Manufacturing . 2006,第3期

机译：具有设置成本和/或设置时间延迟的多臂匪问题的优先级指标启发式
4. The Multi-Armed Bandit Problem under Delayed Rewards Conditions in Digital Campaign Management [C] . M. Martín, A. Jiménez-Martín, A. Mateos International Conference on Control, Decision and Information Technologies . 2019

机译：延迟奖励条件下的数字战役管理中的多武装强盗问题
5. Behavioral models of strategies in multi-armed bandit problems. [D] . Anderson, Christopher Madden. 2001

机译：多武装匪徒问题中策略的行为模型。
6. An Analysis of the Value of Information When Exploring Stochastic Discrete Multi-Armed Bandits [O] . Isaac J. Sledge, José C. Príncipe 2018

机译：探索随机离散多武装匪徒信息的价值分析
7. The Multi-Armed Bandit Problem under Delayed Rewards Conditions in Digital Campaign Management [O] . M. Martin, A. Jimenez-Martin, A. Mateos 2019

机译：数字竞选管理中延迟奖励条件下的多武装强盗问题

A numerical analysis of allocation strategies for the multi-armed bandit problem under delayed rewards conditions in digital campaign management

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅