...
首页> 外文期刊>Information Processing & Management >Improved sentence retrieval using local context and sentence length
【24h】

Improved sentence retrieval using local context and sentence length

机译:使用局部上下文和句子长度改进句子检索

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we propose improved variants of the sentence retrieval method TF-ISF (a TF-IDF or Term Frequency-Inverse Document Frequency variant for sentence retrieval). The improvement is achieved by using context consisting of neighboring sentences and at the same time promoting the retrieval of longer sentences. We thoroughly compare new modified TF-ISF methods to the TF-ISF baseline, to an earlier attempt to include context into TF-ISF named tfmix and to a language modeling based method that uses context and promoting retrieval of long sentences named 3MMPDS. Experimental results show that the TF-ISF method can be improved using local context. Results also show that the TF-ISF method can be improved by promoting the retrieval of longer sentences. Finally we show that the best results are achieved when combining both modifications. All new methods (TF-ISF variants) also show statistically significant better results than the other tested methods.
机译:在本文中,我们提出了句子检索方法TF-ISF的改进变体(用于句子检索的TF-IDF或词频逆文档频率变体)。通过使用包含相邻句子的上下文并同时促进对较长句子的检索,可以实现这种改进。我们将新的修改后的TF-ISF方法与TF-ISF基线进行了彻底比较,更早地尝试将上下文包含到名为tfmix的TF-ISF中,并与使用上下文并促进检索名为3MMPDS的长句子的基于语言建模的方法进行了比较。实验结果表明,使用局部上下文可以改进TF-ISF方法。结果还表明,可以通过促进较长句子的检索来改进TF-ISF方法。最后,我们证明了将两种修改方式组合在一起可获得最佳结果。与其他测试方法相比,所有新方法(TF-ISF变体)也显示出统计上显着更好的结果。

著录项

  • 来源
    《Information Processing & Management》 |2013年第6期|1301-1312|共12页
  • 作者单位

    JP Croatian Telecommunications d.o.o. Mostar, Kralja Tvrtka 78, Mostar, Bosnia and Herzegovina;

    Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture, University of Split, R. Boskovica 32, Split, Croatia;

    Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture, University of Split, R. Boskovica 32, Split, Croatia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Sentence retrieval; TF-ISF; Context; Sentence length;

    机译:句子检索;TF-ISF;上下文;句子长度;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号