首页> 中文期刊> 《铁道工程科学:英文版》 >New Retrieval Method Based on Relative Entropy for LanguageModeling with Different Smoothing Methods

New Retrieval Method Based on Relative Entropy for LanguageModeling with Different Smoothing Methods

         

摘要

A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies of estimated document language models with respect to the estimated query language model. Two popular and relatively efficient smoothing methods, the Jelinek-Mercer method and the Absolute discounting method, are used to smooth the document language model in estimation of the document language. A combined model composed of the feedback document language model and the collection language model is used to estimate the query model. A performacne comparison between the new retrieval method and the existing method with feedback is made, and the retrieval performances of the proposed method with the two different smoothing techniques are evaluated on three Text Retrieval Conference (TREC) data sets. Experimental results show that the method is effective and performs better than the basic language modeling approach ; moreover, the method using the Jelinek-Mercer technique performs better than that using the Absolute discounting technique, and the perfomance is sensitive to the smoothing paramters.

著录项

  • 来源
    《铁道工程科学:英文版》 |2006年第2期|113-120|共8页
  • 作者

    霍华; 刘俊强; 冯博琴;

  • 作者单位

    School of Electronics & Information Engineering;

    Henan University of Science & Technology;

    Luoyang 471003;

    China;

    Department of Computer Science;

    Xi'an Jiaotong University;

    Xi'an 710049;

    China;

    School of Electronics & Information Engineering;

    Henan University of Science & Technology;

    Luoyang 471003;

    China;

    Department of Computer Science;

    Xi'an Jiaotong University;

    Xi'an 710049;

    China;

  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 程序语言、算法语言;
  • 关键词

    语言模型; 检索方法; 平均信息量; 精加工;

    机译:信息检索;相对熵;语言建模;平滑;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号