首页> 外文会议>Comparative evaluation of focused retrieval >Focus and Element Length for Book and Wikipedia Retrieval
【24h】

Focus and Element Length for Book and Wikipedia Retrieval

机译:书籍和维基百科检索的焦点和元素长度

获取原文
获取原文并翻译 | 示例

摘要

In this paper we describe our participation in INEX 2010 in the Ad Hoc Track and the Book Track. In the Ad Hoc track we investigate the impact of propagated anchor-text on article level precision and the impact of an element length prior on the within-document precision and recall. Using the article ranking of an document level run for both document and focused retrieval techniques, we find that focused retrieval techniques clearly outperform document retrieval, especially for the Focused and Restricted Relevant in Context Tasks, which limit the amount of text than can be returned per topic and per article respectively. Somewhat surprisingly, an element length prior increases within-document precision even when we restrict the amount of retrieved text to only 1000 characters per topic. The query-independent evidence of the length prior can help locate elements with a large fraction of relevant text. For the Book Track we look at the relative impact of retrieval units based on whole books, individual pages and multiple pages.
机译:在本文中,我们将在特别跟踪和图书跟踪中描述我们对INEX 2010的参与。在专案跟踪中,我们调查传播的锚文本对文章级别精度的影响,以及元素长度的提前对文档内精度和召回率的影响。使用针对文档检索和集中检索技术的文档级别的文章排名,我们发现集中检索技术明显胜过文档检索,尤其是上下文任务中的“集中和受限相关”,这限制了每次返回的文本量主题和每篇文章。出乎意料的是,即使我们将每个主题的检索文本量限制为仅1000个字符,元素长度优先级也会提高文档内的精度。长度先验的独立于查询的证据可以帮助定位具有大部分相关文本的元素。对于“书本跟踪”,我们基于整个书本,单个页面和多个页面来查看检索单元的相对影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号