Learning Similarity Function for Rare Queries

机译：学习稀有查询的相似功能

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The key element of many query processing tasks can be formalized as calculation of similarities between queries. These include query suggestion, query reformulation, and query expansion. Although many methods have been proposed for query similarity calculation, they could perform poorly on rare queries. As far as we know, there was no previous work particularly about rare query similarity calculation, and this paper tries to study this problem. Specifically, we address three problems. Firstly, we define an n-gram space to represent queries with their own content and a similarity function to measure; the similarities between queries. Secondly, we propose learning the similarity function by leveraging the training data derived from user behavior data. This is formalized as an optimization problem and a metric learning approach is employed to solve it, efficiently. Finally, we exploit locality sensitive hashing for efficient retrieval of similar queries from a large query repository. We experimentally verified the effectiveness of the proposed approach by showing that our method can indeed enhance the accuracy of query similarity calculation for rare queries and efficiently retrieve similar queries. As an application, we also experimentally demonstrated that the similar queries found by our method can significantly improve search relevance.

机译：许多查询处理任务的关键要素可以形式化为查询之间相似度的计算。这些包括查询建议，查询重新制定和查询扩展。尽管已经提出了许多方法来进行查询相似度计算，但是它们在稀有查询中的性能可能很差。据我们所知，以前没有关于稀有查询相似性计算的工作，本文试图研究这个问题。具体来说，我们解决了三个问题。首先，我们定义一个n元语法空间来表示具有自己内容的查询和一个要度量的相似性函数；查询之间的相似性。其次，我们建议利用来自用户行为数据的训练数据来学习相似性函数。这被形式化为优化问题，并采用度量学习方法来有效地解决它。最后，我们利用位置敏感的哈希算法从大型查询存储库中高效检索相似查询。我们通过证明该方法确实可以提高稀有查询的查询相似度计算的准确性并有效地检索相似查询，来实验验证了该方法的有效性。作为应用程序，我们还通过实验证明了通过我们的方法发现的类似查询可以显着提高搜索的相关性。

著录项

来源
《Proceedings of the 4th ACM international conference on web search and data mining.》|2011年|p.615-624|共10页
会议地点 Hong Kong(HK);Hong Kong(HK)
作者
Jingfang Xu; Gu Xu;
展开▼
作者单位

Microsoft Research Asia 4/F, Sigma Building, No.49, Zhichun Road Haidian District, Beijing (100080), China;

Microsoft Research Asia 4/F, Sigma Building, No.49, Zhichun Road Haidian District, Beijing (100080), China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;计算机网络;
关键词
query similarity; rare query; learning similarity function;

机译：查询相似度；罕见查询；学习相似度函数;
入库时间 2022-08-26 14:01:17

相似文献

外文文献
中文文献
专利

1. Audio Query by Example Using Similarity Measures between Probability Density Functions of Features [J] . Marko Helen, Thomas Virtanen EURASIP journal on audio, speech, and music processing . 2010,第4期

机译：使用特征的概率密度函数之间的相似性度量通过示例进行音频查询
2. Audio Query by Example Using Similarity Measures between Probability Density Functions of Features [J] . Marko Helén, Tuomas Virtanen EURASIP journal on audio, speech, and music processing . 2009,第1期

机译：使用特征的概率密度函数之间的相似性度量通过示例进行音频查询
3. Automated query classification based web service similarity technique using machine learning [J] . Balaji B. Saravana, Balakrishnan S., Venkatachalam K., Journal of ambient intelligence and humanized computing . 2021,第6期

机译：基于自动查询分类的基于Web服务相似性技术使用机器学习
4. Learning Similarity Function for Rare Queries [C] . Jingfang Xu, Gu Xu ACM international conference on web search and data mining . 2011

机译：罕见查询的学习相似函数
5. QUERY-FOCUSED EXTRACTIVE SUMMARIZATION BASED ON DEEP LEARNING: COMPARISON OF SIMILARITY MEASURES FOR PSEUDO GROUND TRUTH GENERATION [D] . Yuliska 2019

机译：基于深度学习的查询重点摘要：伪地面真相生成相似度量的比较
6. Searching for rare diseases in PubMed: a blind comparison of Orphanet expert query and query based on terminological knowledge [O] . N. Griffon, M. Schuers, F. Dhombres, 2016

机译：搜索PubMed中的罕见疾病：Orphanet专家查询和基于术语知识的查询的盲目比较
7. Audio Query by Example Using Similarity Measures between Probability Density Functions of Features [O] . 2009

机译：使用特征的概率密度函数之间的相似性度量通过示例进行音频查询

Learning Similarity Function for Rare Queries

摘要

著录项

相似文献

相关主题

期刊订阅