...
首页> 外文期刊>Journal of information and computational science >Chinese Query Reformulation And Variation: a Case Study in Sogou Log
【24h】

Chinese Query Reformulation And Variation: a Case Study in Sogou Log

机译:中文查询的重构和变化:以搜狗日志为例

获取原文
获取原文并翻译 | 示例

摘要

Query reformulation and variation are two of the most difficult and essential aspects of information seeking and retrieval. Current literatures only investigate changes in number of terms in successive queries, but they don't analyze modified positions and reformulation patterns when users reformulate queries. In this paper, we first propose 8 reformulation classes according to different modified positions and changes in number of query terms to analyze query reformulation. Experimental results show that Chinese users prefer to partially reformulate queries to shorter ones. Second, we define reformulation pattern and try to find the most frequently used reformulation pattern by Chinese users. We find that if a user only reformulates query once in one session, he is most likely to add terms to the query at the two ends. Finally, we denote the conception of query variation the query differences with the same search intent. According to Sogou log, different abstractive level of queries can most likely lead to the same click-through set with different queries. The findings of this paper are beneficial to the study of information retrieval modeling and higher preference prediction accuracy.
机译:查询的重新构造和变化是信息搜索和检索中最困难和最基本的两个方面。当前的文献仅调查连续查询中词语数量的变化,但是当用户重新编写查询时,他们没有分析修改过的位置和重新编写的模式。在本文中,我们首先根据不同的修饰位置和查询词数的变化,提出了8种重新定义类,以分析查询重新形成。实验结果表明,与较短的查询相比,中国用户更喜欢对查询进行部分重构。其次,我们定义重新制定模式,并尝试找到中国用户最常用的重新制定模式。我们发现,如果用户在一个会话中仅重新制定一次查询,则他最有可能在两端向查询添加术语。最后,我们表示查询变体的概念,即具有相同搜索意图的查询差异。根据Sogou日志,不同的抽象查询级别很可能导致具有不同查询的相同点击集。本文的发现对信息检索建模的研究和较高的偏好预测精度均有利。

著录项

  • 来源
    《Journal of information and computational science 》 |2010年第1期| P.251-257| 共7页
  • 作者单位

    Lab of Machine intelligence and translation, School of Computer Science and Technology Harbin Institute of Technology, Harbin 150001, China;

    rnLab of Machine intelligence and translation, School of Computer Science and Technology Harbin Institute of Technology, Harbin 150001, China;

    rnLab of Machine intelligence and translation, School of Computer Science and Technology Harbin Institute of Technology, Harbin 150001, China;

    rnLab of Machine intelligence and translation, School of Computer Science and Technology Harbin Institute of Technology, Harbin 150001, China;

    rnDepartment of Computer Science and Technology, Heilongjiang Institute of Technology Harbin 150001, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    log analysis; query differences; query reformulation; chinese users;

    机译:日志分析;查询差异;查询重构;中国用户;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号