首页> 外文会议>Comparative evaluation of focused retrieval >Extended Language Models for XML Element Retrieval
【24h】

Extended Language Models for XML Element Retrieval

机译:XML元素检索的扩展语言模型

获取原文
获取原文并翻译 | 示例

摘要

In this paper we describe our participation in the INEX 2010 ad-hoc track. We participated in three retrieval tasks (restricted focused task, relevant-in-context, restricted relevant-in-context) and report our findings based on a single set of measure for all tasks. In this year's participation, we evaluate the performance of the standard language model that is more focused on a fixed number of relevant characters than on relevant paragraphs. Our findings are: 1) the simplest language model for document retrieval performs relatively well in the restricted focused task when using a fixed offset that is close to the average character distance from the beginning of a document to its main content; 2) a good result of document ranking does improve the performance of snippet retrieval; 3) stemming and stopword removal can further boost performance.
机译:在本文中,我们描述了我们对INEX 2010临时跟踪的参与。我们参与了三个检索任务(受限的重点任务,上下文相关的,受限的上下文相关的),并基于针对所有任务的一套度量来报告我们的发现。在今年的参与中,我们评估了标准语言模型的性能,该模型更加关注固定数量的相关字符而不是相关段落。我们的发现是:1)当使用固定偏移量时,最简单的文档检索语言模型在受限的重点任务中表现相对较好,该偏移量接近从文档开始到其主要内容的平均字符距离; 2)文档排名的良好结果确实提高了代码段检索的性能; 3)移除词干和停用词可以进一步提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号