首页> 外文OA文献 >Preference relations based unsupervised rank aggregation for metasearch
【2h】

Preference relations based unsupervised rank aggregation for metasearch

机译:基于偏好关系的无监督秩聚合元搜索

摘要

Rank aggregation mechanisms have been used in solving problems from various domains such as bioinformatics, natural language processing, information retrieval, etc. Metasearch is one such application where a user gives a query to the metasearch engine, and the metasearch engine forwards the query to multiple individual search engines. Results or rankings returned by these individual search engines are combined using rank aggregation algorithms to produce the final result to be displayed to the user. We identify few aspects that should be kept in mind for designing any rank aggregation algorithms for metasearch. For example, generally equal importance is given to the input rankings while performing the aggregation. However, depending on the indexed set of web pages, features considered for ranking, ranking functions used etc. by the individual search engines, the individual rankings may be of different qualities. So, the aggregation algorithm should give more weight to the better rankings while giving less weight to others. Also, since the aggregation is performed when the user is waiting for response, the operations performed in the algorithm need to be light weight. Moreover, getting supervised data for rank aggregation problem is often difficult. In this paper, we present an unsupervised rank aggregation algorithm that is suitable for metasearch and addresses the aspects mentioned above.ududWe also perform detailed experimental evaluation of the proposed algorithm on four different benchmark datasets having ground truth information. Apart from the unsupervised Kendall-Tau distance measure, several supervised evaluation measures are used for performance comparison. Experimental results demonstrate the efficacy of the proposed algorithm over baseline methods in terms of supervised evaluation metrics. Through these experiments we also show that Kendall-Tau distance metric may not be suitable for evaluating rank aggregation algorithms for metasearch.
机译:排名聚合机制已用于解决来自各个领域的问题,例如生物信息学,自然语言处理,信息检索等。元搜索是这样一种应用程序,用户向元搜索引擎提供查询,而元搜索引擎将查询转发给多个单个搜索引擎。这些个人搜索引擎返回的结果或排名使用排名汇总算法进行组合,以生成最终结果显示给用户。我们发现在设计用于元搜索的任何秩聚合算法时应牢记的几个方面。例如,在执行聚合时,通常给予输入排名同等的重要性。但是,根据网页的索引集,考虑用于排名的功能,各个搜索引擎使用的排名功能等,各个排名可能具有不同的质量。因此,聚合算法应给予更好的排名以更大的权重,而给予其他人较少的权重。另外,由于聚合是在用户等待响应时执行的,因此算法中执行的操作必须轻巧。此外,获取用于等级汇总问题的受监督数据通常很困难。本文中,我们提出了一种适用于元搜索并解决了上述问题的无监督秩聚合算法。 ud ud我们还对具有地面真实性信息的四个不同基准数据集进行了对该算法的详细实验评估。除了无监督的Kendall-Tau距离度量之外,还使用了几种有监督的评估度量来进行性能比较。实验结果表明,在监督评估指标方面,该算法优于基线方法的有效性。通过这些实验,我们还表明,Kendall-Tau距离度量标准可能不适合评估元搜索的秩聚合算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号