首页> 外文会议>International workshop of the initiative for the evaluation of XML retrieval >Focus and Element Length for Book and Wikipedia Retrieval
【24h】

Focus and Element Length for Book and Wikipedia Retrieval

机译:焦点和元素长度为书籍和维基百科检索

获取原文

摘要

In this paper we describe our participation in INEX 2010 in the Ad Hoc Track and the Book Track. In the Ad Hoc track we investigate the impact of propagated anchor-text on article level precision and the impact of an element length prior on the within-document precision and recall. Using the article ranking of an document level run for both document and focused retrieval techniques, we find that focused retrieval techniques clearly outperform document retrieval, especially for the Focused and Restricted Relevant in Context Tasks, which limit the amount of text than can be returned per topic and per article respectively. Somewhat surprisingly, an element length prior increases within-document precision even when we restrict the amount of retrieved text to only 1000 characters per topic. The query-independent evidence of the length prior can help locate elements with a large fraction of relevant text. For the Book Track we look at the relative impact of retrieval units based on whole books, individual pages and multiple pages.
机译:在本文中,我们描述了我们在Ad Hoc轨道和书籍赛道中参与Inex 2010。在Ad Hoc轨道中,我们调查传播的锚文文本对文献精度的影响以及在文档内的精度和召回内部的元素长度的影响。使用文章排名为文档和聚焦检索技术运行的文章等级,我们发现聚焦检索技术显然优先于文档检索,特别是对于在上下文任务中的聚焦和限制相关,这限制了比每次可以返回的文本量主题和每篇文章。有些令人惊讶的是,即使在每个主题只限制检索到的文本到1000个字符的情况下,元素长度也会提高文档精度。关于长度的查询无关证据可以帮助找到具有大量相关文本的元素。对于书道,我们根据整个书籍,单个页面和多页面了解检索单位的相对影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号