首页> 外文会议>International Conference on Machine Learning >Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem
【24h】

Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem

机译:使用频繁的新产品发布动态学习:一个顺序多项式Lo​​git Bainit问题

获取原文

摘要

Motivated by the phenomenon that companies introduce new products to keep abreast with customers' rapidly changing tastes, we consider a novel online learning setting where a profit-maximizing seller needs to learn customers' preferences through offering recommendations, which may contain existing products and new products that are launched in the middle of a selling period. We propose a sequential multinomial logit (SMNL) model to characterize customers' behavior when product recommendations are presented in tiers. For the offline version with known customers' preferences, we propose a polynomial-time algorithm and characterize the properties of the optimal tiered product recommendation. For the online problem, we propose a learning algorithm and quantify its regret bound. Moreover, we extend the setting to incorporate a constraint which ensures every new product is learned to a given accuracy. Our results demonstrate the tier structure can be used to mitigate the risks associated with learning new products.
机译:通过公司推出新产品与客户迅速变化的品味及时了解的现象,我们考虑了一个新的在线学习环境,其中一个利润最大化的卖方需要通过提供建议,以提供现有产品和新产品的建议来学习客户的偏好这是在卖空期间发起的。我们提出了一个顺序多项式Lo​​git(SMNL)模型,以表征客户的行为,当产品建议中呈现在层中。对于具有已知客户偏好的脱机版本,我们提出了一种多项式算法,并表征了最佳分层产品推荐的属性。对于在线问题,我们提出了一种学习算法并量化其遗憾。此外,我们将该设置扩展以结合约束,该约束确保每个新产品学习到给定的准确性。我们的结果证明了层结构可用于减轻与学习新产品相关的风险。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号