...
首页> 外文期刊>Malaysian Journal of Computer Science >Utilization Of Cross-Terms To Enhance The Language Model For Information Retrieval
【24h】

Utilization Of Cross-Terms To Enhance The Language Model For Information Retrieval

机译:利用跨术语增强信息检索的语言模型

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Traditional retrieval models were effective in the early stage of the Web; however, with the huge amount of information that is available on the Web today further optimization is required to enhance the performance of these models in extracting the most relevant information. Utilization of the term proximity is one of the techniques that have been introduced for this purpose by many researchers. It assumes that the words in the user query are correlated and thus proximity between them should be considered in the matching process. Density-based proximity is an effectual type of term proximity measures which is still not fully considered in the retrieval models. In this paper we investigate the application of a recent density-based measure called Cross-Terms which has achieved significant scores when applied on the effective BM25 retrieval model. We applied cross-terms on another effective retrieval model that is the Language Modeling Approach. The performance of the enhanced language model was measured and evaluated through several experiments and metrics. Experiments results show that the cross-terms measure was able to improve the performance of the basic language model in all the applied evaluation metrics. Performance improvement reached (+4%) with the MAP metric and (+8%) with P@5 and P@20 metrics.
机译:传统的检索模型在Web的早期阶段很有效。但是,由于当今Web上提供了大量信息,因此需要进一步优化以增强这些模型在提取最相关信息时的性能。术语“接近”的使用是许多研究者为此目的引入的技术之一。假定用户查询中的单词是相关的,因此在匹配过程中应考虑它们之间的接近度。基于密度的接近度是术语接近度度量的有效类型,在检索模型中仍未完全考虑。在本文中,我们研究了一种最新的基于密度的测度(称为交叉项)的应用,该测度在有效的BM25检索模型上获得了明显的成绩。我们在另一个有效的检索模型语言模型方法中应用了交叉术语。通过一些实验和指标来测量和评估增强的语言模型的性能。实验结果表明,在所有应用的评估指标中,跨术语量测均能够提高基本语言模型的性能。使用MAP指标的性能提高达到(+ 4%),使用P @ 5和P @ 20指标的性能提高达到(+ 8%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号