首页> 外文会议>International Conference on Soft Computing and Machine Intelligence >An enhanced clustering analysis based on glowworm swarm optimization
【24h】

An enhanced clustering analysis based on glowworm swarm optimization

机译:基于萤火虫群优化的增强聚类分析

获取原文

摘要

Data clustering has always been an important aspect of data mining. Extracting clusters from data could be very difficult especially when the features are large and the classes not clearly partitioned, hence the need for high-quality clustering techniques. The major shortcoming of various clustering techniques is that the number of clusters must be stated before the clustering starts. A recent successful work in clustering is the Clustering analysis based on Glowworm Swarm Optimization (CGSO) algorithm. CGSO uses the multimodal search capacity of the Glowworm Swarm Optimization (GSO) algorithm to automatically figure out clusters within a data set without prior knowledge about the number of clusters. However, the sensor range - one of the parameters of the CGSO algorithm and a determinant of the number of clusters as well as the cluster quality - is in fact obtained by trial and error, which is clearly an inefficient approach. Consequently, this paper proposes the Modified Clustering analysis based on Glowworm Swarm Optimization (CGSOm) algorithm. The CGSOm extends the CGSO by incorporating a mechanism that determines the sensor range efficiently and automatically, modifying the glowworm initialization method and introducing a function that measures the cluster error during the iteration phase. The proposed algorithm was tested on artificial and real-world data sets. Experimental results show that for most data sets, the proposed CGSOm algorithm gives better clustering quality results of entropy and purity values when compared with the original CGSO algorithm and four standard clustering algorithms commonly used in the literature. The results reveal that the CGSOm yields better quality clusters.
机译:数据群集始终是数据挖掘的一个重要方面。从数据中提取群集可能是非常困难的,特别是当特征大而且没有明确分区的类时,因此需要高质量的聚类技术。各种聚类技术的主要缺点是必须在聚类开始之前陈述群集数量。最近在聚类的成功工作是基于萤火虫群优化(CGSO)算法的聚类分析。 CGSO使用萤火虫群优化(GSO)算法的多模式搜索容量来自动在数据集中省略群集,而无需先验知识的群集。然而,传感器范围 - CGSO算法的参数之一和集群数量的决定因素以及群集质量 - 实际上是通过试验和误差获得的,这显然是一种低效的方法。因此,本文提出了基于萤火虫群优化(CGSOM)算法的改进的聚类分析。 CGSOM通过结合有效和自动确定传感器范围的机制来扩展CGSOS,修改萤火虫初始化方法并引入迭代阶段期间测量集群错误的功能。在人工和现实世界数据集上测试了所提出的算法。实验结果表明,对于大多数数据集,所提出的CGSOM算法在与原始CGSO算法和文献中常用的四个标准聚类算法相比,熵和纯度值的更好的聚类质量结果。结果表明,CGSOM产生更好的质量簇。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号