首页> 外文期刊>IAENG Internaitonal journal of computer science >Adaptive Clustering Algorithm Based on Max-min Distance and Bayesian Decision Theory
【24h】

Adaptive Clustering Algorithm Based on Max-min Distance and Bayesian Decision Theory

机译:基于最大最小距离和贝叶斯决策理论的自适应聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

K-means clustering algorithm is one of the most famous partitioning clustering techniques that have been widely applied in many fields. Although it is very simple and fast in the process of clustering, the method suffers from a few drawbacks. K-means clustering algorithm requires to specifying the number of clusters which is difficult to know in advance for many real data sets. In addition, K-means clustering algorithm often leads to different clustering results because initial seeds are chosen randomly. To solve these problems, this paper proposes an adaptive clustering algorithm. The new algorithm adopts the idea of continuous partition of a given data set. In the process of each partition, the algorithm can select initial seeds based on max-min distance to obtain a certain result of clustering, and it can evaluate the risk of the clustering result by extending Bayesian decision theory to the field of clustering. Comparing the risk values before and after partitioning, the algorithm can decide whether the data set is continue partitioned, thus it can determine the number of clusters and get the final result of clustering automatically. The performance of the proposed algorithm has been studied on some synthetic and real world data sets. The experimental results illustrate that the new algorithm, without parameter specified by users in advance, is able to obtain efficient clustering results.
机译:K均值聚类算法是最著名的分区聚类技术之一,已广泛应用于许多领域。尽管在聚类过程中它非常简单和快速,但是该方法具有一些缺点。 K-means聚类算法需要指定对于许多实际数据集来说很难事先知道的聚类数。另外,由于随机选择初始种子,因此K均值聚类算法经常导致不同的聚类结果。为了解决这些问题,本文提出了一种自适应聚类算法。新算法采用了连续分割给定数据集的想法。在每个分区的过程中,该算法可以基于最大-最小距离选择初始种子以获得一定的聚类结果,并且可以通过将贝叶斯决策理论扩展到聚类领域来评估聚类结果的风险。通过比较分区前后的风险值,该算法可以确定数据集是否继续分区,从而可以确定聚类数量并自动获得聚类的最终结果。已经在一些合成的和真实的数据集上研究了所提出算法的性能。实验结果表明,该算法无需用户预先指定参数,就能获得有效的聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号