首页> 外文会议>International Conference on Intelligent Human-Machine Systems and Cybernetics >A Comparative Study of Selective Cluster Ensemble for Document Clustering
【24h】

A Comparative Study of Selective Cluster Ensemble for Document Clustering

机译:用于文档聚类的选择性聚类集成的比较研究

获取原文

摘要

Spherical K-means algorithm (SKM) has been widely used for document clustering. Yet it is prone to getting stuck at local optimums. Recently, cluster ensemble has shown to be an effective method in improving the performance of single clustering algorithm such as SKM by combining multiple clustering solutions resulted by SKM. This paper describes and compares three graph partitioning algorithms and four hierarchy clustering algorithms for document clustering, both theoretically and empirically, using a selective cluster ensemble framework. Our experimental results over several document datasets show that, in terms of normalized mutual information, (a) The hierarchy clustering algorithms produce better clustering results than graph partitioning algorithms (b) on most datasets, the best results are arrived when the size of ensemble is between 40 and 60.
机译:球形K均值算法(SKM)已广泛用于文档聚类。但是,它很容易陷入局部最优状态。最近,集群集成已被证明是一种有效的方法,可以通过组合SKM产生的多个聚类解决方案来提高诸如SKM之类的单个聚类算法的性能。本文在理论上和经验上都使用选择性聚类集成框架描述和比较了三种用于文档聚类的图分区算法和四种层次聚类算法。我们在几个文档数据集上的实验结果表明,就归一化的互信息而言,(a)在大多数数据集上,层次聚类算法产生的聚类结果优于图分区算法(b),当集合的大小为在40至60之间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号