首页> 外文期刊>INFORMS journal on computing >Differentially Private and Budget-Limited Bandit Learning over Matroids
【24h】

Differentially Private and Budget-Limited Bandit Learning over Matroids

机译:差异私有和预算有限的匪徒在麦芽糖上学习

获取原文
获取原文并翻译 | 示例
       

摘要

We propose the first budget-limited multi-armed bandit (BMAB) algorithm subject to a union of matroid constraints in arm pulling, while at the same time achieving differential privacy. Our model generalizes the arm-pulling models studied in prior BMAB schemes, and it can be used to address many practical problems such as network backbone construction and dynamic pricing in crowdsourcing. We handle the exploitation versus exploration tradeoff in our BMAB problem by exploiting the combinatorial structures of matroids, and reduce the searching complexity of arm selection based on a divide-and-conquer approach. Our algorithm achieves a uniform logarithmic regret bound with respect to B and ε-differential privacy, where B is the budget for pulling the arms with random costs. Without differential privacy, our algorithm achieves a uniform logarithmic regret bound with respect to B, which advances the asymptotic regret bounds achieved by prior BMAB algorithms. We performed side-by-side comparisons with prior schemes in our experiments. Experimental results show that our purely-combinatorial algorithm not only achieves significantly better regret performance, but also is more than 20 times faster than prior BMAB schemes, which use time-consuming LP-solving techniques.
机译:我们提出了第一个预算有限的多武装强盗(BMAB)算法,其在ARM拉动中的Matroid约束联盟,同时实现差异隐私。我们的模型概括了先前BMAB方案研究的扶手模型,可用于解决许多实际问题,如网络骨干结构和众包中的动态定价。我们通过利用Matroids的组合结构来处理BMAB问题的剥削与勘探权衡,并根据分行和征服方法减少ARM选择的搜索复杂性。我们的算法达到了关于B和ε-差分隐私的统一对数遗憾,其中B是以随机成本拉动臂的预算。在没有差异隐私的情况下,我们的算法达到了相对于B的统一对数遗憾,这导致了先前BMAB算法实现的渐近遗憾界限。我们在我们的实验中与现有方案进行了并排比较。实验结果表明,我们的纯组合算法不仅达到了明显更好的遗憾性能,而且比现有BMAB方案快20倍以上,使用耗时的LP求解技术。

著录项

  • 来源
    《INFORMS journal on computing》 |2020年第3期|790-804|共15页
  • 作者单位

    School of Computer Science and Technology/Suzhou Institute for Advanced Study University of Science and Technology of China Hefei Anhui 23006 People's Republic of China;

    School of Computer Science and Technology/Suzhou Institute for Advanced Study University of Science and Technology of China Hefei Anhui 23006 People's Republic of China;

    Department of Computer Science and Engineering Michigan State University East Lansing Michigan 48824;

    Naveen Jindal School of Management University of Texas at Dallas Richardson Texas 75080;

    School of Computer Science and Technology Soochow University Suzhou 215006 People's Republic of China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    multi-armed bandit; matroid; budget;

    机译:多武装匪徒;matroid;预算;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号