...
首页> 外文期刊>International journal of information retrieval research >A Roadmap to Integrate Document Clustering in Information Retrieval
【24h】

A Roadmap to Integrate Document Clustering in Information Retrieval

机译:在信息检索中集成文档聚类的路线图

获取原文
获取原文并翻译 | 示例

摘要

The World Wide Web is a large distributed digital information space. The ability to search and retrieve information from the Web efficiently and effectively is an enabling technology for realizing its full potential. Information Retrieval (IR) plays an important role in search engines. Todays most advanced engines use the keyword-based ("bag of words ") paradigm, which has inherent disadvantages. Organizing web search results into clusters facilitates the user's quick browsing of search results. Traditional clustering techniques are inadequate because they do not generate clusters with highly readable names. This paper proposes an approach for web search results in clustering based on a phrase based clustering algorithm. It is an alternative to a single ordered result of search engines. This approach presents a list of clusters to the user. Experimental results verify the method's feasibility and effectiveness.
机译:万维网是一个大型的分布式数字信息空间。有效地从Web搜索和检索信息的能力是一种实现其全部潜能的使能技术。信息检索(IR)在搜索引擎中起着重要的作用。当今最先进的引擎使用基于关键字的(“单词袋”)范例,这具有固有的缺点。将Web搜索结果组织到群集中有助于用户快速浏览搜索结果。传统的聚类技术是不足的,因为它们不会生成具有高度可读名称的聚类。本文提出了一种基于词组聚类算法的网络搜索结果聚类方法。它是搜索引擎的单个排序结果的替代方法。这种方法向用户显示了群集列表。实验结果证明了该方法的可行性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号