【24h】

Algorithm for Determining Number of Clusters based on Dichotomy

机译:基于二分法确定簇数的算法

获取原文

摘要

In current clustering algorithms, the number of clusters must be given in advance. And for a given cluster number K has a great impact on the effect of clustering. However, in practice, the specified cluster number K is mostly determined by experience, while partial use of elbow method to find the optimal cluster number requires high time complexity, and the effect is not always optimal. Therefore, there is no unified and effective method to determine the value of cluster number K. To solve this problem, a method to find the number of clusters is proposed. By dichotomizing the current node data each time, the corresponding contour coefficient is obtained, and the size of the contour coefficient before and after calculation is used to judge whether the clustering needs to continue and whether the current node needs to continue adding children nodes until all nodes cannot continue to generate children nodes, and finally a binary tree is formed. And the final number of leaf nodes n of the generated binary tree is taken as the output of the cluster number of the cluster. The experimental results show that the algorithm performs better in the selection of final cluster number and algorithm time.
机译:在当前聚类算法中,必须提前给出群集的数量。对于给定的群集,K对聚类的影响有很大影响。但是,在实践中,指定的群集号k主要由经验决定,而部分使用肘部方法找到最佳簇数需要高时间复杂度,并且效果并不总是最佳的。因此,没有统一和有效的方法来确定簇数K的值。为了解决这个问题,提出了一种找到群集数量的方法。通过每次对当前节点数据进行二分,获得相应的轮廓系数,并且计算之前和之后的轮廓系数的大小用于判断群集是否需要继续以及当前节点是否需要继续添加儿童节点,直到所有节点无法继续生成子节点,最后形成二进制树。并且生成的二叉树的最终数量的叶节点n是群集的群集号的输出。实验结果表明,该算法在选择最终簇数和算法时间方面更好地执行。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号