首页> 外文期刊>Oriental Journal of Computer Science and Technology >On the Consequence of Variation Measure in K- modes Clustering Algorithm
【24h】

On the Consequence of Variation Measure in K- modes Clustering Algorithm

机译:K-模式聚类算法中变化测度的结果

获取原文
获取原文并翻译 | 示例
           

摘要

Organizing data into sensible groupings is one of the most fundamental modes of understanding and learning Clustering is one of the most important data mining techniques that partitions data according to some similarity criterion. The problems of clustering categorical data have attracted much attention from the data mining research community recently.The original k-means algorithm or known as Lloyd's algorithm, is designed to work primarily on numeric data sets. This prohibits the algorithm from being applied to definite data clustering, which is an integral part of data mining and has attracted much attention recently In this paper delineates increase to the k-modes algorithm for clustering definite data. By modifying a simple corresponding Variation measure for definite entities, a heuristic approach was developed in, which allows the use of the k-modes paradigm to obtain a cluster with strong intra-similarity, and to efficiently cluster large definite data sets. The main aim of this paper is to derive severely the updating formula of the k-modes clustering algorithm with the new Variation measure, and the convergence of the algorithm under the optimization framework.
机译:将数据组织成明智的分组是理解和学习的最基本模式之一。聚类是最重要的数据挖掘技术之一,它可以根据一些相似性准则对数据进行分区。最近,分类数据的聚类问题引起了数据挖掘研究界的广泛关注。最初的k均值算法或称为Lloyd算法的设计主要用于数字数据集。这阻碍了该算法被应用于确定数据聚类,这是数据挖掘不可或缺的一部分,并且近来引起了很多关注。本文描述了对确定数据聚类的k模式算法的增加。通过修改用于确定实体的简单对应变化方法,开发了一种启发式方法,该方法允许使用k模式范式来获得具有强内部相似性的聚类,并有效地聚类大的确定数据集。本文的主要目的是通过新的变分测度严格推导k模式聚类算法的更新公式,以及在优化框架下算法的收敛性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号