首页> 外文会议>International world wide web conference;WWW 09 >Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs
【24h】

Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs

机译:通过从搜索日志中学习非常大的可变长度隐马尔可夫模型来实现上下文感知搜索

获取原文

摘要

Capturing the context of a user's query from the previous queries and clicks in the same session may help understand the user's information need. A context-aware approach to document re-ranking, query suggestion, and URL recommendation may improve users' search experience substantially. In this paper, we propose a general approach to context-aware search. To capture contexts of queries, we learn a variable length Hidden Markov Model (vlHMM) from search sessions extracted from log data. Although the mathematical model is intuitive, how to learn a large vlHMM with millions of states from hundreds of millions of search sessions poses a grand challenge. We develop a strategy for parameter initialization in vlHMM learning which can greatly reduce the number of parameters to be estimated in practice. We also devise a method for distributed vlHMM learning under the map-reduce model. We test our approach on a real data set consisting of 1.8 billion queries, 2.6 billion clicks, and 840 million search sessions, and evaluate the effectiveness of the vlHMM learned from the real data on three search applications: document re-ranking, query suggestion, and URL recommendation. The experimental results show that our approach is both effective and efficient.
机译:从同一会话中的先前查询和单击中捕获用户查询的上下文可能有助于了解用户的信息需求。用于文档重新排名,查询建议和URL推荐的上下文感知方法可以大大改善用户的搜索体验。在本文中,我们提出了一种用于上下文感知搜索的通用方法。为了捕获查询的上下文,我们从从日志数据中提取的搜索会话中学习了可变长度的隐马尔可夫模型(vlHMM)。尽管数学模型是直观的,但如何从数亿个搜索会话中学习具有数百万个状态的大型vHMM构成了巨大的挑战。我们开发了一种用于vlHMM学习中的参数初始化的策略,该策略可以大大减少在实践中要估计的参数数量。我们还设计了一种在map-reduce模型下进行分布式vlHMM学习的方法。我们在包含18亿个查询,26亿次点击和8.4亿个搜索会话的真实数据集上测试了我们的方法,并评估了从以下三个搜索应用程序的真实数据中获取的vlHMM的有效性:文档重新排名,查询建议,和网址推荐。实验结果表明,我们的方法既有效又有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号