Query Type Classification for Web Document Retrieval

机译：Web文档检索的查询类型分类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The heterogeneous Web exacerbates IR problems and short riser queries make them worse. The contents of web documents are not enough to find good answer documents. Link information and URL information compensates for the insufficiencies of content information. However, static combination of multiple evidences may lower the retrieval performance. We need different strategies to find target documents according to a query type. We can classify user queries as three categories, the topic relevance task, the homepage finding task, and the service finding task. In this paper, a user query classification scheme is proposed. This scheme uses the difference of distribution, mutual information, the usage rate as anchor texts, and the POS information for the classification. After we classified a user query, we apply different algorithms and information for the better results. For the topic relevance task, we emphasize the content information, on the other hand, for the homepage finding task, we emphasize the Link information and the URL information. We could get the best performance when our proposed classification method with the OKAPI scoring algorithm was used.

机译：异构Web加剧了IR问题，短管上升查询使它们变得更糟。 Web文档的内容不足以找到良好的答案文档。链接信息和URL信息弥补了内容信息的不足。但是，多个证据的静态组合可能会降低检索性能。我们需要不同的策略来根据查询类型查找目标文档。我们可以将用户查询分为三类：主题相关性任务，主页查找任务和服务查找任务。本文提出了一种用户查询分类方案。该方案使用分布的差异，相互信息，使用率作为锚文本以及POS信息进行分类。在对用户查询进行分类之后，我们将应用不同的算法和信息以获得更好的结果。对于主题相关性任务，我们强调内容信息，而对于首页查找任务，我们强调链接信息和URL信息。当使用我们提出的带有OKAPI评分算法的分类方法时，我们可以获得最佳性能。

著录项

来源
《The Twenty-Sixth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Jul 28-Aug 1, 2003 Toronto, Canada》|2003年|p.64-71|共8页
会议地点 Toronto(CA);Toronto(CA);Toronto(CA)
作者
In-Ho Kang; GilChang Kim;
展开▼
作者单位

Division of Computer Science Department of EECS KAIST;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类科学、科学研究;
关键词
combination of multiple evidences; link information; URL information; query classification;

机译：多证据组合；链接信息； URL信息；查询分类;

相似文献

外文文献
中文文献
专利

1. Query Type Classification for Web Document Retrieval [J] . In-Ho Kang, GilChang Kim ACM SIGIR FORUM . 2003,第Special期

机译：Web文档检索的查询类型分类
2. Efficient Top-k Document Retrieval for Long Queries Using Term-Document Binary Matrix — Pursuit of Enhanced Informational Search on the Web — [J] . Etsuro FUJITA, Keizo OYAMA IEICE transactions on information and systems . 2013,第5期

机译：使用术语文档二进制矩阵对长查询进行有效的Top-k文档检索-追求增强的Web信息搜索能力-
3. Efficient Top-k Document Retrieval for Long Queries Using Term-Document Binary Matrix: Pursuit of Enhanced Informational Search on the Web [J] . Etsuro Fujita, Keizo Oyama IEICE Transactions on Information and Systems . 2013,第5期

机译：使用术语文档二进制矩阵对长查询进行有效的Top-k文档检索：追求增强的Web信息搜索
4. Query type classification for web document retrieval [C] . In-Ho Kang, GilChang Kim Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval . 2003

机译：Web文档检索的查询类型分类
5. Reasoning and querying the semantic Web: A document-centric perspective. [D] . Guo, Yuanbo. 2007

机译：推理和查询语义Web：以文档为中心的观点。
6. LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics [O] . Andrew K Smith, Kei-Hoi Cheung, Kevin Y Yip, 2007

机译：LinkHub：语义Web系统可促进蛋白质组学中的跨数据库查询和信息检索
7. Query Type Classification for Web Document Retrieval [O] . In-ho Kang et al. 2003

机译：Web文档检索的查询类型分类
8. Axiomatic Approaches to Information Retrieval - University of Delaware at TREC 2009 Million Query and Web Tracks [R] . Zheng, W., Fang, H. 2009

机译：信息检索的公理化方法 - 特拉华大学在TREC 2009年的百万查询和网络跟踪

Query Type Classification for Web Document Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅