首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning
【24h】

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

机译:算法链接和部分反馈在在线非参数学习中的作用

获取原文
           

摘要

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz and semi-Lipschitz losses with regret bounds improving on the known bounds for standard bandit feedback. Our analysis combines novel results for contextual second-price auctions with a novel algorithmic approach based on chaining. When the context space is Euclidean, our chaining approach is efficient and delivers an even better regret bound.
机译:我们在损失和反馈信息的不同假设下,使用非参数(Lipschitz)比较类调查上下文在线学习。为了获得完整的信息反馈和Lipschitz损失,我们设计了第一个显式算法,以实现minimax后悔率(达到对数因子)。在以第二次拍卖为动机的部分反馈模型中,我们获得了Lipschitz和Semi-Lipschitz损失的算法,后悔界限在标准匪徒反馈的已知界限上得到了改善。我们的分析将基于上下文的第二价格拍卖的新颖结果与基于链接的新颖算法方法结合在一起。当上下文空间是欧几里得时,我们的链接方法是有效的,并提供了更好的后悔约束。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号