【24h】

User-oriented document clustering

机译:面向用户的文档集群

获取原文

摘要

In information retrieval, cluster analysis is an important tool employed to enhance both efficiency and effectiveness of the retrieval process. Most clustering algorithms have difficulty in reflecting the closeness of documents as perceived by the user. A two phase scheme for document clustering, whose results reflect the "conceptual" clusters that are perceived by the user of the retrieval system, is proposed. Since the clusters obtained by this scheme are not characterized in terms of the document representations, a strategy for cluster searching is also developed. Both the proposed document clustering scheme and document searching strategy are experimentally evaluated using a test collection from the SMART system. The preliminary experimental results obtained are very encouraging.

机译:

在信息检索中,聚类分析是用来提高检索过程的效率和有效性的重要工具。大多数聚类算法很难反映出用户所感知的文档的紧密程度。提出了一种两阶段的文档聚类方案,其结果反映了检索系统的用户可以感知的“概念”聚类。由于通过该方案获得的聚类没有根据文档表示来表征,因此还开发了一种聚类搜索策略。拟议的文档聚类方案和文档搜索策略均使用来自SMART系统的测试集合进行了实验评估。初步的实验结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号