首页> 外文期刊>ACM transactions on intelligent systems >Mining Search and Browse Logs for Web Search: A Survey
【24h】

Mining Search and Browse Logs for Web Search: A Survey

机译:挖掘搜索和浏览日志以进行Web搜索:一项调查

获取原文
获取原文并翻译 | 示例
       

摘要

Huge amounts of search log data have been accumulated at Web search engines. Currently, a popular Web search engine may receive billions of queries and collect terabytes of records about user search behavior daily. Beside search log data, huge amounts of browse log data have also been collected through client-side browser plugins. Such massive amounts of search and browse log data provide great opportunities for mining the wisdom of crowds and improving Web search. At the same time, designing effective and efficient methods to clean, process, and model log data also presents great challenges. In this survey, we focus on mining search and browse log data for Web search. We start with an introduction to search and browse log data and an overview of frequently-used data summarizations in log mining. We then elaborate how log mining applications enhance the five major components of a search engine, namely, query understanding, document understanding, document ranking, user understanding, and monitoring and feedback. For each aspect, we survey the major tasks, fundamental principles, and state-of-the-art methods.
机译:Web搜索引擎上已积累了大量的搜索日志数据。当前,流行的Web搜索引擎每天可能会收到数十亿条查询,并收集有关用户搜索行为的TB级记录。除了搜索日志数据外,还通过客户端浏览器插件收集了大量的浏览日志数据。如此大量的搜索和浏览日志数据为挖掘人群的智慧并改善Web搜索提供了巨大的机会。同时,设计有效,高效的方法来清理,处理和建模日志数据也带来了巨大的挑战。在本次调查中,我们专注于挖掘搜索并浏览用于Web搜索的日志数据。我们首先介绍搜索和浏览日志数据,并概述日志挖掘中常用的数据摘要。然后,我们详细介绍日志挖掘应用程序如何增强搜索引擎的五个主要组成部分,即查询理解,文档理解,文档排名,用户理解以及监视和反馈。对于每个方面,我们都会调查主要任务,基本原理和最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号