An Empirical Evaluation of K-Means Clustering Algorithm Using Different Distance/Similarity Metrics

机译：使用不同距离/相似度量的K-Means聚类算法的实证评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

k-means is an effective and efficient clustering algorithm. It uses distance/similarity metric to find out the distance/similarity among the data objects. The objects which are closer/similar to each other are assigned to the same cluster where as distant/dissimilar objects are assigned to different clusters. Most of the implementations of k-means are based on Euclidean/Squared Euclidean distance metrics. In order to find out the possibility of different distance/similarity metrics to be used with k-means algorithm, an empirical evaluation has been performed. In this paper, accuracy, performance and reliability of 13 different distance/similarity measures over 6 different variations of data using k-means algorithm have been compared based on empirical evaluation on well-known benchmark IRIS data set. Accuracy is measured in terms of similarity of cluster assignment between ground truth and machine clustering. Performance is measured in terms of the number of iterations used for convergence of the final cluster assignment. Reliability is measured on the basis of correctness of the cluster assignment.

机译：K-means是一种有效且有效的聚类算法。它使用距离/相似度度量来了解数据对象之间的距离/相似性。彼此靠近/相似的对象被分配给与遥控器/不同对象分配给不同群集的相同的集群。 K-Means的大多数实现基于欧几里德/平方欧几里德距离指标。为了找出要与K-Means算法一起使用的不同距离/相似度的可能性，已经执行了经验评估。本文基于众所周知的基准IRIS数据集的经验评估，比较了13个不同距离/相似性测量的精度，性能和可靠性，通过k-mean算法的实证评估进行了比较。在地面真理和机器聚类之间的集群分配的相似性方面测量精度。性能是以用于最终集群分配的收敛的迭代的数量来衡量。基于集群分配的正确性来测量可靠性。

著录项

来源
《International conference on emerging trends in information technology》|2020年|xxvii 1144 p.|共9页
会议地点
作者
Manoj Kumar Gupta; Pravin Chandra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Data mining; Clustering; K-means; Distance metrics; Similarity metrics;

机译：数据挖掘;聚类;k均值;距离指标;相似度指标;

相似文献

外文文献
中文文献
专利

1. Improving performance of classification on severity of ill effects (SEV) index on fish using K-Means clustering algorithm with various distance metrics [J] . Khakzad Hamid Water Practice and Technology . 2019,第1期

机译：使用具有各种距离指标的K-Means聚类算法提高对鱼类的病害严重性（SEV）指数的分类性能
2. Improving performance of classification on severity of ill effects (SEV) index on fish using K-Means clustering algorithm with various distance metrics [J] . Khakzad Hamid Water Practice and Technology . 2018,第4期

机译：使用具有各种距离指标的K-Means聚类算法提高对鱼类的病害严重性（SEV）指数的分类性能
3. Evaluation Of Fuzzy K-Means And K-Means Clustering Algorithms In Intrusion Detection Systems [J] . Farhad Soleimanian Gharehchopogh, Neda Jabbari, Zeinab Ghaffari Azar International Journal of Scientific & Technology Research . 2012,第11期

机译：入侵检测系统中模糊K-均值和K-均值聚类算法的评估
4. An Empirical Evaluation of K-Means Clustering Algorithm Using Different Distance/Similarity Metrics [C] . Manoj Kumar Gupta, Pravin Chandra International conference on emerging trends in information technology . 2020

机译：使用不同距离/相似度量的K-Means聚类算法的实证评估
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Evaluating performance of health care facilities at meeting HIV-indicator reporting requirements in Kenya: an application of K-means clustering algorithm [O] . Milka Bochere Gesicho, Martin Chieng Were, Ankica Babic 2021

机译：在肯尼亚达到艾滋病病毒指标报告要求时评估医疗设施的表现：K-Means聚类算法的应用
7. Performance Evaluation of K-means Clustering Algorithm with Various Distance Metrics [O] . Y. S. Thakare, S. B. Bagal 2015

机译：不同距离度量K-means聚类算法的性能评估

An Empirical Evaluation of K-Means Clustering Algorithm Using Different Distance/Similarity Metrics

摘要

著录项

相似文献

相关主题

期刊订阅