Performance evaluation of K-means clustering algorithm with various distance metrics

机译：各种距离度量的K-means聚类算法的性能评估

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data Mining is the technique used to visualize and scrutinize the data and drive some useful information from that data so that information can be used to perform any useful work. So clustering is the one of the technique that has been proposed to be used in the area of data mining The notion behind clustering is to assigning objects to cluster based upon some customary characteristics such that object belonging to one cluster are similar other than those belonging to other clusters. There are numerous clustering algorithms available but K-means clustering is widely used to form clusters of colossal dataset. The footprint factor for k-means clustering is its scalability, efficiency, simplicity. This proposed paper aims to study the k-means clustering and various distance function used in k-means clustering such as Euclidean distance function and Manhattan distance function. Experiment and results are shown to observe the effect of these distance function upon k-means clustering. The distance functions are compared using number of iterations, within sum squared errors and time taken to build the full model.

机译：数据挖掘是一种用于可视化和检查数据并从该数据中驱动一些有用信息的技术，以便可以将信息用于执行任何有用的工作。因此，聚类是已被提议用于数据挖掘领域的技术之一。聚类的概念是根据一些习惯特征将对象分配给聚类，从而使属于一个聚类的对象与属于一个聚类的对象相似。其他集群。有许多可用的聚类算法，但K均值聚类被广泛用于形成巨大数据集的聚类。 k均值聚类的足迹因素是其可扩展性，效率，简单性。本文旨在研究k-means聚类和k-means聚类中使用的各种距离函数，例如欧氏距离函数和Manhattan距离函数。实验和结果表明可以观察到这些距离函数对k均值聚类的影响。使用迭代次数，总和误差和构建完整模型所需的时间来比较距离函数。

著录项

来源
《1st IEEE International Conference on Power Electronics, Intelligent Control and Energy Systems》|2016年|1-4|共4页
会议地点 Delhi(IN)
作者
Shruti Kapil; Meenu Chawla;
展开▼
作者单位

Computer Science Engineering Department, GZSCCET, Bathinda, India;

Computer Science Engineering Department, GZSCCET, Bathinda, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Data mining; Euclidean distance; Clustering algorithms; Algorithm design and analysis; Conferences; Power electronics;

机译：数据挖掘;欧式距离;聚类算法;算法设计与分析;会议;电力电子;

相似文献

外文文献
中文文献
专利

1. Improving performance of classification on severity of ill effects (SEV) index on fish using K-Means clustering algorithm with various distance metrics [J] . Khakzad Hamid Water Practice and Technology . 2019,第1期

机译：使用具有各种距离指标的K-Means聚类算法提高对鱼类的病害严重性（SEV）指数的分类性能
2. Improving performance of classification on severity of ill effects (SEV) index on fish using K-Means clustering algorithm with various distance metrics [J] . Khakzad Hamid Water Practice and Technology . 2018,第4期

机译：使用具有各种距离指标的K-Means聚类算法提高对鱼类的病害严重性（SEV）指数的分类性能
3. k-Means clustering with a new divergence-based distance metric: Convergence and performance analysis [J] . Chakraborty Saptarshi, Das Swagatam Pattern recognition letters . 2017,第DECa1期

机译：具有新的基于散度的距离度量的k-Means聚类：收敛和性能分析
4. Performance Evaluation of K-means Clustering Algorithm with Various Distance Metrics [C] . Shruti Kapil, Meenu Chawla IEEE International Conference on Power Electronics, Intelligent Control and Energy Systems . 2016

机译：具有各种距离指标的K-Means聚类算法性能评估
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Performance Evaluation of K-means Clustering Algorithm with Various Distance Metrics [O] . Y. S. Thakare, S. B. Bagal 2015

机译：不同距离度量K-means聚类算法的性能评估
8. K-Means Re-Clustering Algorithmic Options with Quantifiable Performance Comparisons [R] . Meyer, A. W., Paglieroni, D. W., Astaneh, C. 2002

机译：K-means通过可量化的性能比较重新聚类算法选项

Performance evaluation of K-means clustering algorithm with various distance metrics

摘要

著录项

相似文献

相关主题

期刊订阅