首页> 外文期刊>Automatica >Multiple stopping time POMDPs: Structural results & application in interactive advertising on social media
【24h】

Multiple stopping time POMDPs: Structural results & application in interactive advertising on social media

机译:多次停止时间POMDPS:结构& 在社交媒体上互动广告的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This paper considers a multiple stopping time problem for a Markov chain observed in noise, where a decision maker chooses at mostLstopping times to maximize a cumulative objective. We formulate the problem as a Partially Observed Markov Decision Process (POMDP) and derive structural results for the optimal multiple stopping policy. The main results are as follows: (i) The optimal multiple stopping policy is shown to be characterized by threshold curvesΓl, forl=1,…,L, in the unit simplex of Bayesian Posteriors. (ii) The stopping setsSl(defined by the threshold curvesΓl) are shown to exhibit the following nested structureSl?1?Sl. (iii) The optimal cumulative reward is shown to be monotone with respect to the copositive ordering of the transition matrix. (iv) A stochastic gradient algorithm is provided for estimating linear threshold policies by exploiting the structural results. These linear threshold policies approximate the threshold curvesΓl, and share the monotone structure of the optimal multiple stopping policy. (v) Application of the multiple stopping framework to interactively schedule advertisements in live online social media. It is shown that advertisement scheduling using multiple stopping performs significantly better than currently used methods.
机译:本文考虑了在噪声中观察到的马尔可夫链的多次停止时间问题,其中决策者在最典型的时间选择以最大化累积目标。我们将问题制定为部分观察到的马尔可夫决策过程(POMDP),并导出最佳多次停止策略的结构结果。主要结果如下:(i)显示最佳的多次停止策略以阈值曲线γ1,forl = 1,...,l,在贝叶斯海底的单位单位。 (ii)显示停止SETSL(由阈值曲线γ1定义)显示出下列嵌套结构1?1?SL。 (iii)显示最佳累积奖励是关于转换矩阵的二极管排序的单调。 (iv)提供了一种随机梯度算法,用于通过利用结构结果来估计线性阈值策略。这些线性阈值策略近似阈值曲线γ1,并共享最佳多个停止策略的单调结构。 (v)在现场在线社交媒体中互动安排广告的多次停止框架的应用。结果表明,使用多个停止的广告调度明显优于当前使用的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号