首页> 外文会议>2017 IEEE 4th International Conference on Soft Computing amp; Machine Intelligence >An enhanced clustering analysis based on glowworm swarm optimization
【24h】

An enhanced clustering analysis based on glowworm swarm optimization

机译:基于萤火虫优化的增强聚类分析

获取原文
获取原文并翻译 | 示例

摘要

Data clustering has always been an important aspect of data mining. Extracting clusters from data could be very difficult especially when the features are large and the classes not clearly partitioned, hence the need for high-quality clustering techniques. The major shortcoming of various clustering techniques is that the number of clusters must be stated before the clustering starts. A recent successful work in clustering is the Clustering analysis based on Glowworm Swarm Optimization (CGSO) algorithm. CGSO uses the multimodal search capacity of the Glowworm Swarm Optimization (GSO) algorithm to automatically figure out clusters within a data set without prior knowledge about the number of clusters. However, the sensor range - one of the parameters of the CGSO algorithm and a determinant of the number of clusters as well as the cluster quality - is in fact obtained by trial and error, which is clearly an inefficient approach. Consequently, this paper proposes the Modified Clustering analysis based on Glowworm Swarm Optimization (CGSOm) algorithm. The CGSOm extends the CGSO by incorporating a mechanism that determines the sensor range efficiently and automatically, modifying the glowworm initialization method and introducing a function that measures the cluster error during the iteration phase. The proposed algorithm was tested on artificial and real-world data sets. Experimental results show that for most data sets, the proposed CGSOm algorithm gives better clustering quality results of entropy and purity values when compared with the original CGSO algorithm and four standard clustering algorithms commonly used in the literature. The results reveal that the CGSOm yields better quality clusters.
机译:数据集群一直是数据挖掘的重要方面。从数据中提取聚类可能非常困难,尤其是当特征较大且类别未明确划分时,因此需要高质量的聚类技术。各种聚类技术的主要缺点是必须在聚类开始之前说明聚类的数量。聚类的最新成功工作是基于萤火虫群优化算法(CGSO)的聚类分析。 CGSO使用萤火虫群优化(GSO)算法的多模式搜索功能来自动找出数据集中的簇,而无需事先知道簇的数量。但是,传感器范围是CGSO算法的参数之一,它是确定簇数以及簇质量的决定因素,实际上是通过反复试验获得的,这显然是一种无效的方法。因此,本文提出了一种基于萤火虫群优化算法的改进聚类分析方法。 CGSOm通过合并一种机制来扩展CGSO,该机制可以自动有效地确定传感器范围,修改萤火虫初始化方法并引入在迭代阶段测量簇误差的函数。该算法在人工和真实数据集上进行了测试。实验结果表明,与原始CGSO算法和文献中常用的四种标准聚类算法相比,所提出的CGSOm算法对于熵和纯度值的聚类质量结果更好。结果表明,CGSOm产生更好的质量簇。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号