...
首页> 外文期刊>Expert Systems with Application >CF-Rank: Learning to rank by classifier fusion on click-through data
【24h】

CF-Rank: Learning to rank by classifier fusion on click-through data

机译:CF-排名:通过点击数据对分类器融合进行学习排名

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Ranking as a key functionality of Web search engines, is a user-centric process. However, click-through data, which is the source of implicit feedback of users, are not included in almost all of datasets published for the task of ranking. This limitation is also observable in the majority of benchmark datasets prepared for the learning to rank which is a new and promising trend in the information retrieval literature. In this paper, inspiring from the click-through data concept, the notion of click-through features is introduced. Click-through features could be derived from the given primitive dataset even in the absence of click-through data in the utilized benchmark dataset These features are categorized into three different categories and are either related to the users' queries, results of searches or clicks of users. With the use of click-through features, in this research, a novel learning to rank algorithm is proposed. By taking into account informativeness measures such as MAP, NDCG, InformationGain and OneR, at its first step, the proposed algorithm generates a classifier for each category of click-through features. Thereafter, these classifiers are fused together by using exponential ordered weighted averaging operators. Experimental results obtained from a plenty of investigations on WCL2R and LETOR4.0 benchmark datasets, demonstrate that the proposed method can substantially outperform well-known ranking methods in the presence of explicit click-through data based on MAP and NDCG criteria. Specifically, such an improvement is more noticeable on the top of ranked lists, which usually attract users' attentions more than other parts of these lists. This betterment on WCL2R dataset is about 20.25% for P@1 and 5.68% for P@3 in comparison with SVMRank, which is a well-known learning to rank algorithm. CF-Rank can also obtain higher or comparable performance with baseline methods even in the absence of explicit click-through data in utilized primitive datasets. In this regard, the proposed method on the LETOR4.0 dataset has achieved an improvement of about 2.7% on MAP measure compared to AdaRank-NDCG algorithm. (C) 2015 Elsevier Ltd. All rights reserved.
机译:排名是Web搜索引擎的关键功能,是一个以用户为中心的过程。但是,点击数据是用户隐式反馈的来源,几乎没有包含在为排名任务发布的所有数据集中。在为学习排名而准备的大多数基准数据集中,这种局限性也是可以观察到的,这是信息检索文献中的一个新的有希望的趋势。在本文中,从点击数据概念的启发出发,介绍了点击功能的概念。即使在所利用的基准数据集中不存在点击数据,也可以从给定的原始数据集中获得点击特征。这些特征被分为三类,它们与用户的查询,搜索结果或点击次数有关。用户。利用点击功能,本研究提出了一种新颖的学习排序算法。通过考虑诸如MAP,NDCG,InformationGain和OneR之类的信息量度措施,在提出的第一步中,所提出的算法会为每个点击特征类别生成分类器。此后,通过使用指数有序加权平均算子将这些分类器融合在一起。从对WCL2R和LETOR4.0基准数据集的大量研究中获得的实验结果表明,在存在基于MAP和NDCG标准的明确点击数据的情况下,该方法可以大大胜过众所周知的排名方法。具体来说,这种改进在排名最高的列表上更为明显,通常比这些列表的其他部分更能吸引用户的注意力。与SVMRank相比,WCL2R数据集的P @ 1改进约20.25%,P @ 3改进约5.68%,SVMRank是众所周知的排序算法。即使在所利用的原始数据集中没有明确的点击数据,CF-Rank也可以使用基准方法获得更高或相当的性能。在这方面,与AdaRank-NDCG算法相比,在LETOR4.0数据集上提出的方法在MAP度量上实现了约2.7%的改进。 (C)2015 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号