首页> 外文会议>International Conference on Machine Learning and Computing >Genetically Improved PSO Algorithm for Efficient Data Clustering
【24h】

Genetically Improved PSO Algorithm for Efficient Data Clustering

机译:基因改进的PSO算法用于高效数据聚类

获取原文

摘要

Clustering is an important research topic in data mining that appears in a wide range of unsupervised classification applications. Partitional clustering algorithms such as the k-means algorithm are the most popular for clustering large datasets. The major problem with the k-means algorithm is that it is sensitive to the selection of the initial partitions and it may converge to local optima. In this paper, we present a hybrid two-phase GAI-PSO+k-means data clustering algorithm that performs fast data clustering and can avoid premature convergence to local optima. In the first phase we utilize the new genetically improved particle swarm optimization algorithm (GAI-PSO) which is a population-based heuristic search technique modeled on the hybrid of cultural and social rules derived from the analysis of the swarm intelligence (PSO) and the concepts of natural selection and evolution (GA). The GAI-PSO combines the standard velocity and position update rules of PSOs with the ideas of selection, mutation and crossover from GAs. The GAI-PSO algorithm searches the solution space to find the optimal initial cluster centroids for the next phase. The second phase is a local refining stage utilizing the k-means algorithm which can efficiently converge to the optimal solution. The proposed algorithm combines the ability of the globalized searching of the evolutionary algorithms and the fast convergence of the k-means algorithm and can avoid the drawback of both. The performance of the proposed algorithm is evaluated through several benchmark datasets. The experimental results show that the proposed algorithm is highly forceful and outperforms the previous approaches such as SA, ACO, PSO and k-means for the partitional clustering problem.
机译:群集是数据挖掘中的一个重要研究主题,这些主题出现在广泛的无监督的分类应用程序中。诸如K-means算法之类的分区聚类算法是群集大型数据集最受欢迎的算法。 K-means算法的主要问题是它对初始分区的选择敏感,并且它可能会聚到本地Optima。在本文中,我们介绍了一个混合的两相GAI-PSO + K-means数据聚类算法,执行快速数据聚类,可以避免对本地Optima的过早汇聚。在第一阶段,我们利用新的遗传改进的粒子群优化算法(GAI-PSO),该算法(GAI-PSO)是一种基于人口的启发式搜索技术,这些搜索技术模仿了来自群体智能(PSO)的分析的文化和社会规则的混合和自然选择和进化的概念(GA)。 GAI-PSO将PSO的标准速度和位置更新规则与来自天然气的选择,突变和交叉的思想相结合。 GAI-PSO算法搜索解决方案空间以查找下一阶段的最佳初始集群质心。第二阶段是利用K-means算法的本地精炼阶段,其可以有效地收敛到最佳解决方案。该算法结合了全球化搜索进化算法的能力和K-Means算法的快速收敛性,可以避免两者的缺点。通过多个基准数据集进行评估所提出的算法的性能。实验结果表明,该算法非常有力,优于先前的方法,例如SA,ACO,PSO和K-inse的分配问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号