首页> 外文OA文献 >DCU-TCD@LogCLEF 2010: re-ranking document collections and query performance estimation
【2h】

DCU-TCD@LogCLEF 2010: re-ranking document collections and query performance estimation

机译:DCU-TCD @ LogCLEF 2010:重新排列文档集和查询性能估计

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper describes the collaborative participation of Dublin City University and Trinity College Dublin in LogCLEF 2010. Two sets of experiments were conducted. First, different aspects of the TEL query logs were analysed after extracting user sessions of consecutive queries on a topic. The relation between the queries and their length (number of terms) and position (first query or further reformulations) was examined in a session with respect to query performance estimators such as queryudscope, IDF-based measures, simplified query clarity score, and average inverse document collection frequency. Results of this analysis suggest that only some estimator values show a correlation with query length or position in the TEL logs (e.g. similarity score between collection and query). Second, the relation between three attributes was investigated: the user's country (detected from IP address), the query language, and the interface language. The investigation aimed to explore the influence of the three attributes on the user's collection selection. Moreover, the investigation involved assigning different weights to the three attributes in a scoring function that was used to re-rank the collections displayed to the user according to the language and country. The results of theudcollection re-ranking show a significant improvement in Mean Average Precision (MAP) over the original collection ranking of TEL. The results also indicate that the query language and interface language have more inuduence than the user's country on the collections selected by the users.
机译:本文介绍了都柏林城市大学和都柏林三一学院在LogCLEF 2010上的合作参与。进行了两组实验。首先,在提取有关主题的连续查询的用户会话之后,对TEL查询日志的不同方面进行了分析。在会话中,针对查询性能估算器(例如query udscope,基于IDF的量度,简化的查询清晰度得分和),检查了查询及其长度(项数)和位置(首次查询或进一步的重新定义)之间的关系。平均逆文档收集频率。分析结果表明,只有一些估计值显示与TEL日志中查询长度或位置的相关性(例如,收集和查询之间的相似性得分)。其次,研究了三个属性之间的关系:用户所在的国家(从​​IP地址检测到),查询语言和界面语言。该调查旨在探讨这三个属性对用户的收藏集选择的影响。此外,调查涉及在评分功能中为三个属性分配不同的权重,该评分功能用于根据语言和国家/地区对显示给用户的馆藏进行重新排名。 udcollect的重新排序结果表明,平均平均精度(MAP)相对于TEL的原始集合排名有了显着提高。结果还表明,查询语言和界面语言对用户选择的集合的影响程度大于用户所在国家/地区。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号