...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Online Learning Using Only Peer Prediction
【24h】

Online Learning Using Only Peer Prediction

机译:仅使用对等预测的在线学习

获取原文
           

摘要

This paper considers a variant of the classical online learning problem with expert predictions. Our model’s differences and challenges are due to lacking any direct feedback on the loss each expert incurs at each time step $t$. We propose an approach that uses peer prediction and identify conditions where it succeeds. Our techniques revolve around a carefully designed peer score function $s()$ that scores experts’ predictions based on the peer consensus. We show a sufficient condition, that we call emph{peer calibration}, under which standard online learning algorithms using loss feedback computed by the carefully crafted $s()$ have bounded regret with respect to the unrevealed ground truth values. We then demonstrate how suitable $s()$ functions can be derived for different assumptions and models.
机译:本文考虑了具有专家预测的经典在线学习问题的变体。我们的模型的差异和挑战是由于缺乏关于损失的任何直接反馈,每个专家每次步骤$ t $。我们提出一种使用对等预测的方法,并识别成功的条件。我们的技术围绕着精心设计的同行评分函数$ s()$,以达到同行协商一致的专家的预测。我们展示了一种充分的条件,我们呼叫 emph {peer校准},在使用由仔细制作$ s()$计算的损耗反馈的标准在线学习算法,对未伪造的地面真相价值有界遗憾。然后,我们演示如何为不同的假设和模型导出S()$函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号