Per-Round Knapsack-Constrained Linear Submodular Bandits

Baosheng Yu; Meng Fang; Dacheng Tao

首页> 外文期刊>Neural computation >Per-Round Knapsack-Constrained Linear Submodular Bandits

【24h】

Per-Round Knapsack-Constrained Linear Submodular Bandits

机译：每轮背负背包约束的线性次模匪

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Linear submodular bandits has been proven to be effective in solving the diversification and feature-based exploration problem in information retrieval systems. Considering there is inevitably a budget constraint in many web-based applications, such as news article recommendations and online advertising, we study the problem of diversification under a budget constraint in a bandit setting. We first introduce a budget constraint to each exploration step of linear submodular bandits as a new problem, which we call per-round knapsack-constrained linear submodular bandits. We then define an -approximation unit-cost regret considering that the submodular function maximization is NP-hard. To solve this new problem, we propose two greedy algorithms based on a modified UCB rule. We prove these two algorithms with different regret bounds and computational complexities. Inspired by the lazy evaluation process in submodular function maximization, we also prove that a modified lazy evaluation process can be used to accelerate our algorithms without losing their theoretical guarantee. We conduct a number of experiments, and the experimental results confirm our theoretical analyses.

机译：事实证明，线性次模块化强盗可有效解决信息检索系统中的多样化和基于特征的探索问题。考虑到在许多基于Web的应用程序中不可避免地存在预算约束，例如新闻文章推荐和在线广告，我们研究了在强盗环境下预算约束下的多样化问题。首先，我们将预算约束引入到线性子模块化强盗的每个探索步骤中，作为一个新问题，我们将其称为每轮背包约束线性子模块化强盗。然后，考虑到子模函数最大化是NP-hard，我们定义一个近似单位成本后悔。为了解决这个新问题，我们提出了两种基于修改后的UCB规则的贪婪算法。我们证明了这两种算法具有不同的后悔界限和计算复杂性。受子模块函数最大化中的惰性评估过程的启发，我们还证明了改进的惰性评估过程可用于加速算法，而不会失去其理论保证。我们进行了许多实验，实验结果证实了我们的理论分析。

著录项

来源
《Neural computation》 |2016年第12期|2757-2789|共33页
作者
Baosheng Yu; Meng Fang; Dacheng Tao;
展开▼
作者单位

Centre for Artificial Intelligence Faculty of Engineering and Information Technology University of Technology Sydney Sydney NSW 2007 Australia baosheng.yu@student.uts.edu.au;

Department of Computing and Information Systems University of Melbourne Victoria 3010 Australia meng.fang@unimelb.edu.au;

Centre for Artificial Intelligence Faculty of Engineering and Information Technology University of Technology Sydney Sydney NSW 2007 Australia Dacheng.Tao@uts.edu.au;

展开▼
收录信息美国《科学引文索引》(SCI);美国《化学文摘》(CA);
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Submodular Bandit Problem Under Multiple Constraints [J] . Sho Takemori, Masahiro Sato, Takashi Sonoda, JMLR: Workshop and Conference Proceedings . 2020,第2010期

机译：多个约束下的子模块匪徒问题
2. Approximation algorithms for submodular vertex cover problems with linear/submodular penalties using primal-dual technique [J] . Xu Dachuan, Wang Fengmin, Du Donglei, Theoretical computer science . 2016,第Null期

机译：使用原始对偶技术的线性/次模罚分的次模顶点覆盖问题的近似算法
3. PRIMAL-DUAL APPROXIMATION ALGORITHMS FOR SUBMODULAR COST SET COVER PROBLEMS WITH LINEAR/SUBMODULAR PENALTIES [J] . FENGMIN WANG, DACHUAN XU, DONGLEI DU, Numerical Algebra, Control and Optimization . 2015,第2期

机译：具有线性/亚模块惩罚性的亚模块成本集覆盖问题的本原逼近算法
4. Cascading Linear Submodular Bandits: Accounting for Position Bias and Diversity in Online Learning to Rank [C] . Gaurush Hiranandani, Harvineet Singh, Prakhar Gupta, Conference on Uncertainty in Artificial Intelligence . 2019

机译：级联线性子模块匪徒：在线学习中的位置偏见和多样性核算
5. Graph Cuts, Sum-of-Submodular Flow, and Linear Programming: Effective Inference in Higher-Order Markov Random Fields. [D] . Fix, Alexander. 2017

机译：图割，亚模总和和线性规划：高阶马尔可夫随机场中的有效推论。
6. Submodular Maximization via Gradient Ascent: The Case of Deep Submodular Functions [O] . Wenruo Bai, William S Noble, Jeff A. Bilmes -1

机译：通过梯度上升的亚模最大化：深亚模函数的情况
7. Beyond pointwise submodularity: Non-monotone adaptive submodular maximization in linear time [O] . Shaojie Tang 2021

机译：超越尖子骨折：线性时间的非单调自适应子模块化最大化

Per-Round Knapsack-Constrained Linear Submodular Bandits

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅