首页> 外文期刊>International journal of information system modeling and design >On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms
【24h】

On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms

机译:信息检索中过去搜索的重用:两种概率算法的研究

获取原文
获取原文并翻译 | 示例
           

摘要

When using information retrieval systems, information related to searches is typically stored in files, which are well known as log files. By contrast, past search results of previously submitted queries are ignored most of the time. Nevertheless, past search results can be profitable for new searches. Some approaches in Information Retrieval exploit the previous searches in a customizable way for a single user. On the contrary, approaches that deal with past searches collectively are less common. This paper deals with such an approach, by using past results of similar past queries submitted by other users, to build the answers for new submitted queries. It proposes two Monte Carlo algorithms to build the result for a new query by selecting relevant documents associated to the most similar past query. Experiments were carried out to evaluate the effectiveness of the proposed algorithms using several dataset variants. These algorithms were also compared with the baseline approach based on the cosine measure, from which they reuse past results. Simulated datasets were designed for the experiments, following the Cranfield paradigm, well established in the Information Retrieval domain. The empirical results show the interest of our approach.
机译:使用信息检索系统时,与搜索有关的信息通常存储在文件中,这些文件称为日志文件。相比之下,以前提交的查询的过去搜索结果通常会被忽略。尽管如此,过去的搜索结果对于新搜索还是有利可图的。信息检索中的某些方法以可定制的方式为单个用户利用以前的搜索。相反,共同处理过去搜索的方法不太普遍。本文通过使用其他用户提交的类似过去​​查询的过去结果来处理这种方法,从而为新提交的查询建立答案。它提出了两种蒙特卡洛算法,以通过选择与最相似的过去查询相关的相关文档来构建新查询的结果。使用几种数据集变体进行了实验,以评估所提出算法的有效性。这些算法也与基于余弦测度的基线方法进行了比较,它们从中重新使用了过去的结果。按照Cranfield范式为实验设计了模拟数据集,该范式已在信息检索领域中确立。实证结果表明我们的方法很有趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号