首页> 外文会议>International conference on knowledge science, engineering and management >Online Kernel Selection with Multiple Bandit Feedbacks in Random Feature Space
【24h】

Online Kernel Selection with Multiple Bandit Feedbacks in Random Feature Space

机译:随机特征空间中具有多个Bandit反馈的在线内核选择

获取原文

摘要

Online kernel selection is critical to online kernel learning, and must address the exploration-exploitation dilemma, where we explore new kernels to find the best one and exploit the kernel that showed the best performance in the past. In this paper, we propose a novel multi-armed bandit solution to the exploration-exploitation dilemma in online kernel selection. We first correspond each candidate kernel to an arm of a multi-armed bandit problem. Different from typical multi-armed bandit models where only one kernel is selected at each round, we sample multiple kernels with replacement according to a probability distribution. Then, we make prediction with the hypotheses learned in the random feature spaces specified by the selected kernels, and incur multiple losses referred to as multiple bandit feedbacks. Finally, we use all the feedbacks to update the probability distribution. We prove that the proposed approach enjoys a sub-linear expected regret bound. Experimental results on benchmark datasets show that the proposed approach has a comparable performance with existing online kernel selection methods.
机译:在线内核选择对于在线内核学习至关重要,并且必须解决探索开发难题,我们在其中探索新内核以找到最佳内核,并利用过去表现出最佳性能的内核。在本文中,我们提出了一种新颖的多臂强盗解决方案,以解决在线内核选择中的勘探开发难题。我们首先将每个候选内核对应到多臂强盗问题的一个分支。与典型的多臂土匪模型不同,在这种模型中,每一轮只选择一个内核,我们根据概率分布对多个内核进行替换采样。然后,我们使用在选定内核指定的随机特征空间中学习到的假设进行预测,并产生称为多个强盗反馈的多个损失。最后,我们使用所有反馈来更新概率分布。我们证明了所提出的方法具有次线性期望后悔界限。在基准数据集上的实验结果表明,该方法具有与现有在线内核选择方法相当的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号