Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

Nicolò Cesa-Bianchi; Pierre Gaillard; Claudio Gentile; Sébastien Gerchinovitz

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

【24h】

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

机译：算法链接和部分反馈在在线非参数学习中的作用

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz and semi-Lipschitz losses with regret bounds improving on the known bounds for standard bandit feedback. Our analysis combines novel results for contextual second-price auctions with a novel algorithmic approach based on chaining. When the context space is Euclidean, our chaining approach is efficient and delivers an even better regret bound.

机译：我们在损失和反馈信息的不同假设下，使用非参数（Lipschitz）比较类调查上下文在线学习。为了获得完整的信息反馈和Lipschitz损失，我们设计了第一个显式算法，以实现minimax后悔率（达到对数因子）。在以第二次拍卖为动机的部分反馈模型中，我们获得了Lipschitz和Semi-Lipschitz损失的算法，后悔界限在标准匪徒反馈的已知界限上得到了改善。我们的分析将基于上下文的第二价格拍卖的新颖结果与基于链接的新颖算法方法结合在一起。当上下文空间是欧几里得时，我们的链接方法是有效的，并提供了更好的后悔约束。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第6期|共17页
作者
Nicolò Cesa-Bianchi; Pierre Gaillard; Claudio Gentile; Sébastien Gerchinovitz;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning [J] . Nicolò Cesa-Bianchi, Pierre Gaillard, Claudio Gentile, JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：算法链接和部分反馈在在线非参数学习中的作用
2. A Potential-based Framework for Online Multi-class Learning with Partial Feedback [J] . Hamed Valizadegan, Rong Jin, Shijun Wang JMLR: Workshop and Conference Proceedings . 2010,第2010期

机译：具有部分反馈的基于潜力的在线多班学习框架
3. A Potential-based Framework for Online Multi-class Learning with Partial Feedback [J] . Hamed Valizadegan, Rong Jin, Shijun Wang JMLR: Workshop and Conference Proceedings . 2010,第2010期

机译：具有部分反馈的基于潜力的在线多班学习框架
4. Evaluation of Online Assessment: The Role of Feedback in Learner-Centered e-Learning [C] . Noorminshah Iahad, Emmanouil kalaitzakis, Georgios A. Dafoulas, Annual Hawaii International Conference on System Sciences . 2004

机译：在线评估评估：反馈在学习者中心的电子学习中的作用
5. Online Learning and Decision Making with Partial Information, a Feedback Perspective [D] . Rangi, Anshuka. 2021

机译：使用部分信息的在线学习和决策，反馈视角
6. The anatomy of a distributed predictive modeling framework: online learning blockchain network and consensus algorithm [O] . Tsung-Ting Kuo 2020

机译：分布式预测建模框架的解剖学：在线学习区块链和共识算法
7. Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback [O] . Behrooz Shahriari -1

机译：通用在线学习部分可见和动态环境，具有延迟反馈
8. Algorithms for Markov Decision Chains with Partial Information [R] . Loeve, J. A. 1993

机译：具有部分信息的马尔可夫决策链的算法

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

摘要

著录项

相似文献

相关主题

期刊订阅