首页> 外文会议>International World Wide Web Conference >Time-Dependent Semantic Similarity Measure of Queries Using Historical Click-Through Data
【24h】

Time-Dependent Semantic Similarity Measure of Queries Using Historical Click-Through Data

机译:使用历史点击数据数据的查询的时间依赖性语义相似度测量

获取原文

摘要

It has become a promising direction to measure similarity of Web search queries by mining the increasing amount of click-through data logged by Web search engines, which record the interactions between users and the search engines. Most existing approaches employ the click-through data for similarity measure of queries with little consideration of the temporal factor, while the click-through data is often dynamic and contains rich temporal information. In this paper we present a new framework of time-dependent query semantic similarity model on exploiting the temporal characteristics of historical click-through data. The intuition is that more accurate semantic similarity values between queries can be obtained by taking into account the timestamps of the log data. With a set of user-defined calendar schema and calendar patterns, our time-dependent query similarity model is constructed using the marginalized kernel technique, which can exploit both explicit similarity and implicit semantics from the click-through data effectively. Experimental results on a large set of click-through data acquired from a commercial search engine show that our time-dependent query similarity model is more accurate than the existing approaches. Moreover, we observe that our time-dependent query similarity model can, to some extent, reflect real-world semantics such as real-world events that are happening over time.
机译:通过挖掘Web搜索引擎记录的越来越多的点击数据,它已经成为衡量Web搜索查询的相似性的有希望的方向,该网页搜索引擎记录了用户和搜索引擎之间的交互。大多数现有方法采用点击数据,以了解查询的相似性度量,几乎没有考虑时间因子,而点击数据通常是动态的,并且包含丰富的时间信息。本文在利用历史点击数据的时间特征来提高时间依赖查询语义相似性模型的新框架。直觉是通过考虑日志数据的时间戳,可以获得查询之间更准确的语义相似性值。使用一组用户定义的日历模式和日历模式,我们的时间依赖性查询相似性模型是使用边缘化内核技术构造的,可以有效地利用点击式数据来利用显式相似性和隐式语义。从商业搜索引擎获取的大量点击点数据上的实验结果表明,我们的时间依赖查询相似性模型比现有方法更准确。此外,我们观察到我们的时间依赖查询相似之处可以在一定程度上反映现实世界的语义,例如随着时间的推移发生的真实世界事件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号