New bounds on the price of bandit feedback for mistake-bounded online multiclass learning

Long Philip M.

首页> 外文期刊>Theoretical computer science >New bounds on the price of bandit feedback for mistake-bounded online multiclass learning

【24h】

New bounds on the price of bandit feedback for mistake-bounded online multiclass learning

机译：用于错误的在线多种单位学习的强盗反馈价格的新界限

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper is about two generalizations of the mistake bound model to online multiclass classification. In the standard model, the learner receives the correct classification at the end of each round, and in the bandit model, the learner only finds out whether its prediction was correct or not. For a set F of multiclass classifiers, let opt(std)(F) and opt(bandit)(F) be the optimal bounds for learning F according to these two models. We show that an

机译：本文是关于在线多字符分类的错误绑定模型的两个概括。在标准模型中，学习者在每轮末端接收正确的分类，并在强盗模型中，学习者只发现其预测是否正确。对于多条比例分类器的组F，让选择（STD）（F）和OPT（BANDIT）（F）是根据这两个模型学习F的最佳限制。我们展示了一个

著录项

来源
《Theoretical computer science》 |2020年第2020期|共5页
作者
Long Philip M.;
展开▼
作者单位

Google 1600 Amphitheatre Pkwy Mountain View CA 94043 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Online learning; Bandit feedback; Mistake-bound model; Learning theory;

机译：在线学习;强盗反馈;错误束缚模型;学习理论;

相似文献

外文文献
中文文献
专利

1. New bounds on the price of bandit feedback for mistake-bounded online multiclass learning [J] . Long Philip M. Theoretical computer science . 2020,第期

机译：用于错误的在线多种单位学习的强盗反馈价格的新界限
2. Online Multiclass Boosting with Bandit Feedback [J] . Daniel T. Zhang, Young Hun Jung, Ambuj Tewari JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：带有强盗反馈的在线多类别提升
3. Online Multiclass Boosting with Bandit Feedback [J] . Daniel T. Zhang, Young Hun Jung, Ambuj Tewari JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：带有强盗反馈的在线多类别提升
4. Online Multiclass Learning with "Bandit" Feedback under a Confidence-Weighted Approach [C] . Chaoran Shi, Xiong Wang, Xiaohua Tian, IEEE Global Communications Conference . 2016

机译：置信加权方法下的“强盗”反馈在线多类别学习
5. Efficient Online Learning with Bandit Feedback [D] . Liu, Fang. 2020

机译：高效在线学习与强盗反馈
6. Receptivity and Feedback to the Online Endodontics Congress Concept as a Learning Option - An International Survey [O] . Joao Meirinhos, Mariana Domingos Pires, Rui Pereira da Costa, 2020

机译：接受性和反馈与在线endogontics国会概念作为学习选项 - 国际调查
7. A note on the price of bandit feedback for mistake-bounded online learning [O] . Jesse Geneson 2021

机译：关于误区在线学习的强盗反馈价格的说明

New bounds on the price of bandit feedback for mistake-bounded online multiclass learning

摘要

著录项

相似文献

相关主题

期刊订阅