【24h】

Defensive Universal Learning with Experts

机译:与专家进行防御性通用学习

获取原文
获取原文并翻译 | 示例

摘要

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. Prom this, we obtain a master algorithm for "reactive" experts problems, which means that the master's actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs - in a certain sense - almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.
机译:本文展示了如何在专家建议下实现通用学习。为此,我们指定了一种具有以下特征的专家算法:(a)它仅使用实际选择的操作(强盗设置)的反馈;(b)它可以应用于无数的无限专家类别,并且(c)可以应对。损失可能会适时缓慢地增长。我们证明了适应性对手的损失界限。对此,我们获得了一个针对“反应性”专家问题的大师算法,这意味着大师的行为可能会影响对手的行为。在此类问题上,我们的算法可以大大优于标准专家算法。最后,我们将其与通用专家类相结合。在某种意义上,由此产生的通用学习器在任何在线决策问题上的表现几乎与任何可计算的策略一样好。我们还指定了(最坏情况)收敛速度,这是非常慢的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号