首页> 外文会议>CIKM 10;ACM conference on information and knowledge management >A Late Fusion Approach to Cross-lingual Document Re-ranking
【24h】

A Late Fusion Approach to Cross-lingual Document Re-ranking

机译:跨语言文档重新排序的后期融合方法

获取原文

摘要

The field of information retrieval still strives to develop models which allow semantic information to be integrated in the ranking process to improve performance in comparison to standard bag-of-words based models. Cross-lingual information retrieval is an example of where such a model is required, as content or concepts often need to be matched across languages. To overcome this problem, a conceptual model has been adopted in ranking an entire corpus which normally exploits latent/implicit features of the text. One of the drawbacks of this model is that the computational cost is significant and often intractable in modern test collections. Therefore, approaches utilizing concept-based models for re-ranking initial retrieval results have attracted a considerable amount of study, in particular the latent concept model. However, fitting such a model to a smaller collection is less meaningful than fitting it into the whole corpus. This paper proposes a late fusion method which incorporates scores generated by using external knowledge to enhance the space produced by the latent concept method. This method is further demonstrated to be suitable for multilingual re-ranking purposes. To illustrate the effectiveness of the proposed method, experiments were conducted over test collections across three languages. The results demonstrate that the method can comfortably achieve improvements in retrieval performance over several re-ranking methods.
机译:信息检索领域仍在努力开发模型,该模型允许将语义信息集成到排序过程中,以与基于标准词袋的模型相比提高性能。跨语言信息检索是其中需要这种模型的示例,因为内容或概念通常需要跨语言进行匹配。为了克服这个问题,已经采用了一种概念模型来对整个语料库进行排名,该语料库通常利用文本的潜在/隐式特征。该模型的缺点之一是计算成本很高,并且在现代测试集合中通常是棘手的。因此,利用基于概念的模型对初始检索结果进行重新排序的方法吸引了相当多的研究,尤其是潜在的概念模型。但是,将这种模型适合较小的集合比将其适合整个语料库意义不大。本文提出了一种后期融合方法,该方法融合了使用外部知识生成的分数来增强潜在概念方法所产生的空间。进一步证明了该方法适用于多语言重新排序的目的。为了说明所提出方法的有效性,在三种语言的测试集合上进行了实验。结果表明,与几种重新排序方法相比,该方法可以舒适地实现检索性能的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号