首页> 外文会议>Knowledge Discovery and Data Mining, 2010. WKDD '10 >Attributes Scaling for K-means Algorithm Controlled by Misclassification of All Clusters
【24h】

Attributes Scaling for K-means Algorithm Controlled by Misclassification of All Clusters

机译:所有聚类分类错误控制的K均值算法的属性缩放

获取原文

摘要

K-means clustering is one of the well-known distance-based clustering methods which partitions data into distinct groups. To implement an automatic attribute-scaled K-mean algorithm, the concept of classification has been integrated. Data points which belong to the same target class are considered similar in K-means clustering. In this paper, we explore and determine the optimal attribute-scaled vector that minimizes misclassification error of the target class. This paper uses the non-linear unconstrained optimization technique in attribute-scaled space, called the cyclic coordinate method together with the golden section line search to find the optimal vector. Our experiments show that the methods can provide the optimal scaling vectors which effectively reduce the misclassification error of supervised K-means clustering and lead to the effective supervised clustering in some data sets.
机译:K均值聚类是众所周知的基于距离的聚类方法之一,该方法将数据划分为不同的组。为了实现自动按比例缩放的K均值算法,已经集成了分类的概念。属于相同目标类别的数据点在K均值聚类中被认为是相似的。在本文中,我们探索并确定了最佳的属性缩放矢量,该矢量可最大程度地减少目标类别的错误分类错误。本文在属性可缩放空间中使用非线性无约束优化技术,称为循环坐标法和黄金分割线搜索,以找到最佳矢量。我们的实验表明,这些方法可以提供最佳的缩放向量,从而有效地减少了监督K均值聚类的误分类误差,并在某些数据集中导致了有效的监督聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号