【24h】

A NEW CLUSTER MERGING ALGORITHM OF SUFFIX TREE CLUSTERING

机译:后缀树聚类的一种新的聚类融合算法

获取原文
获取原文并翻译 | 示例

摘要

Document clustering methods can be used to structure large sets of text or hypertext documents.rnSuffix Tree Clustering has been proved to be a good approach for documents clustering. However, the cluster merging algorithm of Suffix Tree Clustering is based on the overlap of their document sets, which totally ignore the similarity between the non-overlap parts of different clusters. In this paper, we introduce a novel cluster merging approach which will combines the cosine similarity and overlap percentage. Using this method, we can get a better clustering result and a comparative small number of clusters.
机译:文档聚类方法可用于构造大量文本或超文本文档。rn后缀树聚类已被证明是一种很好的文档聚类方法。但是,后缀树聚类的聚类合并算法基于其文档集的重叠,这完全忽略了不同聚类的非重叠部分之间的相似性。在本文中,我们介绍了一种新颖的聚类合并方法,它将合并余弦相似度和重叠百分比。使用这种方法,我们可以获得更好的聚类结果和相对较少的聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号