【24h】

On-line Learning with Delayed Label Feedback

机译:延迟标签反馈的在线学习

获取原文
获取原文并翻译 | 示例

摘要

We generalize on-line learning to handle delays in receiving labels for instances. After receiving an instance x, the algorithm may need to make predictions on several new instances before the label for x is returned by the environment. We give two simple techniques for converting a traditional on-line algorithm into an algorithm for solving a delayed on-line problem. One technique is for instances generated by an adversary; the other is for instances generated by a distribution. We show how these techniques effect the original on-line mistake bounds by giving upper-bounds and restricted lower-bounds on the number of mistakes.
机译:我们对在线学习进行一般化处理,以处理实例接收标签的延迟。在接收到实例x之后,该算法可能需要在环境返回返回x的标签之前对几个新实例进行预测。我们提供了两种简单的技术,可以将传统的在线算法转换为解决延迟的在线问题的算法。一种技术是针对对手产生的实例。另一个是由分发生成的实例。我们通过给出错误数量的上限和下限来说明这些技术如何影响原始的在线错误范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号