首页> 外文会议>International Conference on Computer Processing of Oriental Languages >Clique Percolation Method for Finding Naturally Cohesive and Overlapping Document Clusters
【24h】

Clique Percolation Method for Finding Naturally Cohesive and Overlapping Document Clusters

机译:用于查找天然凝聚力和重叠文件集群的集团渗透方法

获取原文

摘要

Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic technique called Clique Percolation Method (CPM) for document clustering. In this method, a process of enumerating highly cohesive maximal document cliques is performed in a random graph, where those strongly adjacent cliques are mingled to form naturally overlapping clusters. Our clustering results can unveil the inherent structural connections of the underlying data. Experiments show that CPM can outperform some typical algorithms on benchmaxk data sets, and shed light on its advantages on natural document clustering.
机译:查找文档群集的技术主要取决于强大的显式和/或隐式先验假设的模型。因此,聚类效应往往是不自然的,并且远离文件收集的内在分组自然。我们应用一种名为Clique Percolation方法(CPM)的新型图形理论技术,用于文档聚类。在该方法中,在随机图中进行枚举高粘合的最大文件群的过程,其中强烈相邻的群体混合以形成天然的重叠簇。我们的聚类结果可以揭示底层数据的固有结构连接。实验表明,CPM可以在Benchmaxk数据集上优于一些典型的算法,并阐明其在自然文档聚类上的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号