首页> 外文期刊>Journal of Information Science >Word-embedding-based pseudo-relevance feedback for Arabic information retrieval
【24h】

Word-embedding-based pseudo-relevance feedback for Arabic information retrieval

机译:基于词嵌入的伪相关反馈用于阿拉伯语信息检索

获取原文
获取原文并翻译 | 示例
       

摘要

Pseudo-relevance feedback (PRF) is a very effective query expansion approach, which reformulates queries by selecting expansion terms from top k pseudo-relevant documents. Although standard PRF models have been proven effective to deal with vocabulary mismatch between users' queries and relevant documents, expansion terms are selected without considering their similarity to the original query terms. In this article, we propose a method to incorporate word embedding (WE) similarity into PRF models for Arabic information retrieval (IR). The main idea is to select expansion terms using their distribution in the set of top pseudo-relevant documents along with their similarity to the original query terms. Experiments are conducted on the standard Arabic TREC 2001/2002 collection using three neural WE models. The obtained results show that our PRF extensions significantly outperform their baseline PRF models. Moreover, they enhanced the baseline IR model by 22% and 68% for the mean average precision (MAP) and the robustness index (RI), respectively.
机译:伪相关反馈(PRF)是一种非常有效的查询扩展方法,该方法通过从前k个伪相关文档中选择扩展项来重新构造查询。尽管已经证明标准PRF模型可以有效地解决用户查询和相关文档之间的词汇不匹配问题,但是在选择扩展术语时并未考虑它们与原始查询术语的相似性。在本文中,我们提出了一种将单词嵌入(WE)相似度纳入PRF模型以进行阿拉伯语信息检索(IR)的方法。主要思想是使用扩展术语在顶级伪相关文档集中的分布以及与原始查询术语的相似性来选择扩展术语。使用三个神经网络模型在标准的阿拉伯文TREC 2001/2002集合上进行了实验。获得的结果表明,我们的PRF扩展明显优于其基线PRF模型。此外,他们分别将平均平均精度(MAP)和稳健性指数(RI)的基线IR模型提高了22%和68%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号