首页> 外文会议>Advances in focused retrieval >Overview of the INEX 2008 Ad Hoc Track
【24h】

Overview of the INEX 2008 Ad Hoc Track

机译:INEX 2008专项跟踪概述

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper gives an overview of the INEX 2008 Ad Hoc Track. The main goals of the Ad Hoc Track were two-fold. The first goal was to investigate the value of the internal document structure (as provided by the XML mark-up) for retrieving relevant information. This is a continuation of INEX 2007 and, for this reason, the retrieval results are liberalized to arbitrary passages and measures were chosen to fairly compare systems retrieving elements, ranges of elements, and arbitrary passages. The second goal was to compare focused retrieval to article retrieval more directly than in earlier years. For this reason, standard document retrieval rankings have been derived from all runs, and evaluated with standard measures. In addition, a set of queries targeting Wikipedia have been derived from a proxy log, and the runs are also evaluated against the clicked Wikipedia pages. The INEX 2008 Ad Hoc Track featured three tasks: For the Focused Task a ranked-list of non-overlapping results (elements or passages) was needed. For the Relevant in Context Task non-overlapping results (elements or passages) were returned grouped by the article from which they came. For the Best in Context Task a single starting point (element start tag or passage start) for each article was needed. We discuss the results for the three tasks, and examine the relative effectiveness of element and passage retrieval. This is examined in the context of content only (CO, or Keyword) search as well as content and structure (CAS, or structured) search. Finally, we look at the ability of focused retrieval techniques to rank articles, using standard document retrieval techniques, both against the judged topics as well as against queries and clicks from a proxy log.
机译:本文概述了INEX 2008临时跟踪。临时跟踪的主要目标有两个。第一个目标是调查内部文档结构(由XML标记提供)的价值,以检索相关信息。这是INEX 2007的延续,因此,将检索结果放宽为任意段落,并选择了度量以公平地比较检索元素,元素范围和任意段落的系统。第二个目标是比往年更直接地将重点检索与文章检索进行比较。因此,已经从所有运行中得出了标准的文档检索排名,并使用标准方法进行了评估。此外,还从代理日志中获取了一组针对Wikipedia的查询,并且还针对单击的Wikipedia页面评估了运行。 INEX 2008特设跟踪具有三个任务:对于重点任务,需要一个不重叠结果(元素或段落)的排名列表。对于“上下文任务相关”,返回的非重叠结果(元素或段落)按它们来自的文章分组。对于最佳上下文任务,每个文章都需要一个起点(元素起点标签或段落起点)。我们讨论了这三个任务的结果,并检查了元素和段落检索的相对有效性。仅在内容(CO或关键字)搜索以及内容和结构(CAS或结构化)搜索的上下文中对此进行检查。最后,我们将研究针对标准主题检索技术,针对已判定主题以及针对代理日志中的查询和点击的集中检索技术对文章进行排名的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号