【24h】

Learning Similarity Function for Rare Queries

机译:学习稀有查询的相似功能

获取原文
获取原文并翻译 | 示例

摘要

The key element of many query processing tasks can be formalized as calculation of similarities between queries. These include query suggestion, query reformulation, and query expansion. Although many methods have been proposed for query similarity calculation, they could perform poorly on rare queries. As far as we know, there was no previous work particularly about rare query similarity calculation, and this paper tries to study this problem. Specifically, we address three problems. Firstly, we define an n-gram space to represent queries with their own content and a similarity function to measure; the similarities between queries. Secondly, we propose learning the similarity function by leveraging the training data derived from user behavior data. This is formalized as an optimization problem and a metric learning approach is employed to solve it, efficiently. Finally, we exploit locality sensitive hashing for efficient retrieval of similar queries from a large query repository. We experimentally verified the effectiveness of the proposed approach by showing that our method can indeed enhance the accuracy of query similarity calculation for rare queries and efficiently retrieve similar queries. As an application, we also experimentally demonstrated that the similar queries found by our method can significantly improve search relevance.
机译:许多查询处理任务的关键要素可以形式化为查询之间相似度的计算。这些包括查询建议,查询重新制定和查询扩展。尽管已经提出了许多方法来进行查询相似度计算,但是它们在稀有查询中的性能可能很差。据我们所知,以前没有关于稀有查询相似性计算的工作,本文试图研究这个问题。具体来说,我们解决了三个问题。首先,我们定义一个n元语法空间来表示具有自己内容的查询和一个要度量的相似性函数;查询之间的相似性。其次,我们建议利用来自用户行为数据的训练数据来学习相似性函数。这被形式化为优化问题,并采用度量学习方法来有效地解决它。最后,我们利用位置敏感的哈希算法从大型查询存储库中高效检索相似查询。我们通过证明该方法确实可以提高稀有查询的查询相似度计算的准确性并有效地检索相似查询,来实验验证了该方法的有效性。作为应用程序,我们还通过实验证明了通过我们的方法发现的类似查询可以显着提高搜索的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号