首页> 外文会议>Web technologies and applications >Key Concepts Identification and Weighting in Search Engine Queries
【24h】

Key Concepts Identification and Weighting in Search Engine Queries

机译:搜索引擎查询中的关键概念识别和权重

获取原文
获取原文并翻译 | 示例

摘要

It has been widely observed that queries of search engine are becoming longer and closer to natural language. Actually, current search engines do not perform well with natural language queries. Accurately discovering the key concepts of these queries can dramatically improve the effectiveness of search engines. It has been shown that queries seem to be composed in a way that how users summarize documents, which is so much similar to anchor texts. In this paper, we present a technique for automatic extraction of key concepts from queries with anchor texts analysis. Compared with using web counts of documents, we proposed a supervised machine learning model to classify the concepts of queries into 3 sets according to their importance and types. In the end of this paper, we also demonstrate that our method has remarkable improvement over the baseline.
机译:广泛观察到,搜索引擎的查询越来越长,越来越接近自然语言。实际上,当前的搜索引擎在使用自然语言查询时效果不佳。准确发现这些查询的关键概念可以大大提高搜索引擎的效率。已经显示出查询似乎以用户如何汇总文档的方式构成,与锚文本非常相似。在本文中,我们提出了一种通过锚文本分析从查询中自动提取关键概念的技术。与使用文档的Web计数相比,我们提出了一种有监督的机器学习模型,根据查询的重要性和类型将查询的概念分为3组。在本文的最后,我们还证明了我们的方法相对于基线具有显着的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号