首页> 外文会议>1st IEEE International Conference on Power Electronics, Intelligent Control and Energy Systems >Performance evaluation of K-means clustering algorithm with various distance metrics
【24h】

Performance evaluation of K-means clustering algorithm with various distance metrics

机译:各种距离度量的K-means聚类算法的性能评估

获取原文
获取原文并翻译 | 示例

摘要

Data Mining is the technique used to visualize and scrutinize the data and drive some useful information from that data so that information can be used to perform any useful work. So clustering is the one of the technique that has been proposed to be used in the area of data mining The notion behind clustering is to assigning objects to cluster based upon some customary characteristics such that object belonging to one cluster are similar other than those belonging to other clusters. There are numerous clustering algorithms available but K-means clustering is widely used to form clusters of colossal dataset. The footprint factor for k-means clustering is its scalability, efficiency, simplicity. This proposed paper aims to study the k-means clustering and various distance function used in k-means clustering such as Euclidean distance function and Manhattan distance function. Experiment and results are shown to observe the effect of these distance function upon k-means clustering. The distance functions are compared using number of iterations, within sum squared errors and time taken to build the full model.
机译:数据挖掘是一种用于可视化和检查数据并从该数据中驱动一些有用信息的技术,以便可以将信息用于执行任何有用的工作。因此,聚类是已被提议用于数据挖掘领域的技术之一。聚类的概念是根据一些习惯特征将对象分配给聚类,从而使属于一个聚类的对象与属于一个聚类的对象相似。其他集群。有许多可用的聚类算法,但K均值聚类被广泛用于形成巨大数据集的聚类。 k均值聚类的足迹因素是其可扩展性,效率,简单性。本文旨在研究k-means聚类和k-means聚类中使用的各种距离函数,例如欧氏距离函数和Manhattan距离函数。实验和结果表明可以观察到这些距离函数对k均值聚类的影响。使用迭代次数,总和误差和构建完整模型所需的时间来比较距离函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号