首页> 外文会议>ACM conference on information and knowledge management >A Late Fusion Approach to Cross-lingual Document Re-ranking
【24h】

A Late Fusion Approach to Cross-lingual Document Re-ranking

机译:交叉语言文件重新排名的后期融合方法

获取原文

摘要

The field of information retrieval still strives to develop models which allow semantic information to be integrated in the ranking process to improve performance in comparison to standard bag-of-words based models. Cross-lingual information retrieval is an example of where such a model is required, as content or concepts often need to be matched across languages. To overcome this problem, a conceptual model has been adopted in ranking an entire corpus which normally exploits latent/implicit features of the text. One of the drawbacks of this model is that the computational cost is significant and often intractable in modern test collections. Therefore, approaches utilizing concept-based models for re-ranking initial retrieval results have attracted a considerable amount of study, in particular the latent concept model. However, fitting such a model to a smaller collection is less meaningful than fitting it into the whole corpus. This paper proposes a late fusion method which incorporates scores generated by using external knowledge to enhance the space produced by the latent concept method. This method is further demonstrated to be suitable for multilingual re-ranking purposes. To illustrate the effectiveness of the proposed method, experiments were conducted over test collections across three languages. The results demonstrate that the method can comfortably achieve improvements in retrieval performance over several re-ranking methods.
机译:信息检索领域仍然致力于开发模式,允许被集成在排名过程中的语义信息,相较于标准袋的词基于模型以提高性能。跨语言信息检索是在需要这样的模型的一个例子,作为内容或经常需要跨语言被匹配概念。为了克服这个问题,一个概念模型已排名通常利用了文本的潜/隐式的特征的整个语料库通过。一个这种模式的缺点是计算成本是现代测试集合显著,往往难以解决。因此,使用方法为重排序初始检索结果基于概念的模型已经吸引了相当数量的研究,特别是潜在的概念模型。然而,到一个较小的集合拟合这种模型比它嵌入到整个阴茎意义不大。本文提出了一种包含通过使用外部知识来增强由潜概念方法制造的空间中产生的分数后期融合方法。此方法进一步证明是适用于多语言重新排序的目的。为了说明该方法的有效性,实验跨三种语言进行了测试集合。结果表明,该方法可以舒适地实现检索性能的改进在几个重排序方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号