Online Multiclass Boosting with Bandit Feedback

Daniel T. Zhang; Young Hun Jung; Ambuj Tewari

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Online Multiclass Boosting with Bandit Feedback

【24h】

Online Multiclass Boosting with Bandit Feedback

机译：带有强盗反馈的在线多类别提升

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present online boosting algorithms for multiclass classification with bandit feedback, where the learner only receives feedback about the correctness of its prediction. We propose an unbiased estimate of the loss using a randomized prediction, allowing the model to update its weak learners with limited information. Using the unbiased estimate, we extend two full information boosting algorithms (Jung et al., 2017) to the bandit setting. We prove that the asymptotic error bounds of the bandit algorithms exactly match their full information counterparts. The cost of restricted feedback is reflected in the larger sample complexity. Experimental results also support our theoretical findings, and performance of the proposed models is comparable to that of an existing bandit boosting algorithm, which is limited to use binary weak learners.

机译：我们提出了带有强盗反馈的在线多分类算法，其中学习者仅收到有关其预测正确性的反馈。我们建议使用随机预测对损失进行无偏估计，从而使模型可以用有限的信息更新其弱学习者。使用无偏估计，我们将两个完整的信息增强算法（Jung等，2017）扩展到了强盗设置。我们证明了强盗算法的渐近误差范围与它们的全部信息完全匹配。有限反馈的成本反映在更大的样本复杂度上。实验结果也支持了我们的理论发现，并且所提出的模型的性能可与现有的匪徒增强算法相媲美，后者仅限于使用二进制弱学习者。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第12期|共9页
作者
Daniel T. Zhang; Young Hun Jung; Ambuj Tewari;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Online Multiclass Boosting with Bandit Feedback [J] . Daniel T. Zhang, Young Hun Jung, Ambuj Tewari JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：带有强盗反馈的在线多类别提升
2. New bounds on the price of bandit feedback for mistake-bounded online multiclass learning [J] . Long Philip M. Theoretical computer science . 2020,第期

机译：用于错误的在线多种单位学习的强盗反馈价格的新界限
3. New bounds on the price of bandit feedback for mistake-bounded online multiclass learning [J] . Philip M. Long JMLR: Workshop and Conference Proceedings . 2017,第2010期

机译：用于错误的在线多种单位学习的强盗反馈价格的新界限
4. Online Multiclass Learning with "Bandit" Feedback under a Confidence-Weighted Approach [C] . Chaoran Shi, Xiong Wang, Xiaohua Tian, IEEE Global Communications Conference . 2016

机译：置信加权方法下的“强盗”反馈在线多类别学习
5. Efficient Online Learning with Bandit Feedback [D] . Liu, Fang. 2020

机译：高效在线学习与强盗反馈
6. Conditional Random Field (CRF)-Boosting: Constructing a Robust Online Hybrid Boosting Multiple Object Tracker Facilitated by CRF Learning [O] . Ehwa Yang, Jeonghwan Gwak, Moongu Jeon 2017

机译：条件随机场（CRF）-增强：通过CRF学习促进构建健壮的在线混合增强多对象跟踪器
7. Machine Learning manuscript No. (will be inserted by the editor) Multiclass Classification with Bandit Feedback using Adaptive [O] . Koby Crammer, Claudio Gentile 2014

机译：机器学习原稿编号（将由编辑插入）使用自适应编程进行带状反馈的多类分类

Online Multiclass Boosting with Bandit Feedback

摘要

著录项

相似文献

相关主题

期刊订阅