...
首页> 外文期刊>Wuhan University Journal of Natural Sciences >Automatic User Goals Identification Based on Anchor Text and Click-Through Data
【24h】

Automatic User Goals Identification Based on Anchor Text and Click-Through Data

机译:基于锚文本和点击数据的自动用户目标识别

获取原文
获取原文并翻译 | 示例
           

摘要

Understanding the underlying goal behind a user's Web query has been proved to be helpful to improve the quality of search. This paper focuses on the problem of automatic identification of query types according to the goals. Four novel entropy-based features extracted from anchor data and click-through data are proposed, and a support vector machines (SVM) classifier is used to identify the user's goal based on these features. Experimental results show that the proposed entropy-based features are more effective than those reported in previous work. By combining multiple features the goals for more than 97% of the queries studied can be correctly identified. Besides these, this paper reaches the following important conclusions: First, anchor-based features are more effective than click-through-based features; Second, the number of sites is more reliable than the number of links; Third, click-distribution- based features are more effective than session-based ones.
机译:事实证明,了解用户的Web查询背后的基本目标有助于提高搜索质量。本文着重于根据目标自动识别查询类型的问题。提出了四个从锚点数据和点击数据中提取的基于熵的新颖特征,并使用支持向量机(SVM)分类器基于这些特征识别用户的目标。实验结果表明,所提出的基于熵的特征比以前的工作更有效。通过组合多个功能,可以正确确定研究的97%以上查询的目标。除此之外,本文还得出以下重要结论:首先,基于锚的功能比基于点击的功能更有效。其次,站点数量比链接数量更可靠;第三,基于点击分配的功能比基于会话的功能更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号