首页> 外文期刊>Journal of Artificial Intelligence and Soft Computing Research >Clustering Large-Scale Data Based On Modified Affinity Propagation Algorithm
【24h】

Clustering Large-Scale Data Based On Modified Affinity Propagation Algorithm

机译:基于改进的亲和力传播算法的大规模数据聚类

获取原文
           

摘要

Traditional clustering algorithms are no longer suitable for use in data mining applications that make use of large-scale data. There have been many large-scale data clustering algorithms proposed in recent years, but most of them do not achieve clustering with high quality. Despite that Affinity Propagation (AP) is effective and accurate in normal data clustering, but it is not effective for large-scale data. This paper proposes two methods for large-scale data clustering that depend on a modified version of AP algorithm. The proposed methods are set to ensure both low time complexity and good accuracy of the clustering method. Firstly, a data set is divided into several subsets using one of two methods random fragmentation or K-means. Secondly, subsets are clustered into K clusters using K-Affinity Propagation (KAP) algorithm to select local cluster exemplars in each subset. Thirdly, the inverse weighted clustering algorithm is performed on all local cluster exemplars to select well-suited global exemplars of the whole data set. Finally, all the data points are clustered by the similarity between all global exemplars and each data point. Results show that the proposed clustering method can significantly reduce the clustering time and produce better clustering result in a way that is more effective and accurate than AP, KAP, and HAP algorithms.
机译:传统的聚类算法不再适合在使用大规模数据的数据挖掘应用程序中使用。近年来,已经提出了许多大规模的数据聚类算法,但是大多数算法不能实现高质量的聚类。尽管亲和传播(AP)在常规数据聚类中是有效且准确的,但对于大规模数据却无效。本文提出了两种依赖于AP算法修改版本的大规模数据聚类方法。所提出的方法被设置为确保低时间复杂度和聚类方法的良好准确性。首先,使用随机分段或K均值的两种方法之一将数据集划分为几个子集。其次,使用K-Affinity Propagation(KAP)算法将子集聚类为K个聚类,以选择每个子集中的局部聚类样本。第三,对所有局部聚类样本执行逆加权聚类算法,以选择整个数据集的合适的全局样本。最后,所有数据点都通过所有全局样本与每个数据点之间的相似性进行聚类。结果表明,与AP,KAP和HAP算法相比,所提出的聚类方法可以显着减少聚类时间,并产生更好的聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号