首页> 外文期刊>Communications in Statistics >An iterative algorithm for optimal variable weighting in K-means clustering
【24h】

An iterative algorithm for optimal variable weighting in K-means clustering

机译:K-Means群集中最佳变量加权的迭代算法

获取原文
获取原文并翻译 | 示例
           

摘要

The K-means clustering method is a widely adopted clustering algorithm in data mining and pattern recognition, where the partitions are made by minimizing the total within group sum of squares based on a given set of variables. Weighted K-means clustering is an extension of the K-means method by assigning nonnegative weights to the set of variables. In this paper, we aim to obtain more meaningful and interpretable clusters by deriving the optimal variable weights for weighted K-means clustering. Specifically, we improve the weighted k-means clustering method by introducing a new algorithm to obtain the globally optimal variable weights based on the Karush-Kuhn-Tucker conditions. We present the mathematical formulation for the clustering problem, derive the structural properties of the optimal weights, and implement an recursive algorithm to calculate the optimal weights. Numerical examples on simulated and real data indicate that our method is superior in both clustering accuracy and computational efficiency.
机译:K-means聚类方法是数据挖掘和模式识别中广泛采用的聚类算法,其中通过基于给定的一组变量最小化群体的总和内的总和来进行分区。加权k-means聚类是通过将非负重量分配给该组变量来扩展k-means方法。在本文中,我们的目标是通过导出加权K-means聚类的最佳变量权重来获得更有意义和可解释的群集。具体地,我们通过引入一种新的算法来提高加权K-Means聚类方法,以基于Karush-Kuhn-Tucker条件获得全局最佳的可变权重。我们介绍了聚类问题的数学制定,导出了最佳权重的结构特性,并实现了递归算法来计算最佳权重。模拟和实数据的数值例子表明我们的方法均以聚类精度和计算效率优异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号