首页> 外文会议>Advances in information retrieval >Balancing Exploration and Exploitation in Learning to Rank Online
【24h】

Balancing Exploration and Exploitation in Learning to Rank Online

机译:在线学习排名中平衡探索与开发

获取原文
获取原文并翻译 | 示例

摘要

As retrieval systems become more complex, learning to rank approaches are being developed to automatically tune their parameters. Using online learning to rank approaches, retrieval systems can learn directly from implicit feedback, while they are running. In such an online setting, algorithms need to both explore new solutions to obtain feedback for effective learning, and exploit what has already been learned to produce results that are acceptable to users. We formulate this challenge as an exploration-exploitation dilemma and present the first online learning to rank algorithm that works with implicit feedback and balances exploration and exploitation. We leverage existing learning to rank data sets and recently developed click models to evaluate the proposed algorithm. Our results show that finding a balance between exploration and exploitation can substantially improve online retrieval performance, bringing us one step closer to making online learning to rank work in practice.
机译:随着检索系统变得越来越复杂,正在开发学习排序方法来自动调整其参数。使用在线学习对方法进行排名,检索系统可以在运行时直接从隐式反馈中学习。在这样的在线环境中,算法既需要探索新的解决方案以获得有效学习的反馈,又需要利用已经学到的知识来产生用户可以接受的结果。我们将此挑战表述为探索与开发的困境,并提出了第一个在线学习排序算法,该算法可与隐式反馈配合使用并平衡探索与开发。我们利用现有的知识对数据集进行排名,并利用最近开发的点击模型来评估所提出的算法。我们的结果表明,在探索与开发之间找到平衡可以大大提高在线检索性能,使我们更接近于使在线学习对工作进行排名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号