K and starting means for k-means algorithm

Fahim Ahmed

首页> 外文期刊>Journal of computational science >K and starting means for k-means algorithm

【24h】

K and starting means for k-means algorithm

机译：K and starting means for k-means algorithm

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

The k-means method aims to divide a set of N objects into k clusters, where each cluster is represented by the mean value of its objects. This algorithm is simple and converges to local minima quickly. It has linear time complexity, but it requires the number of clusters in advance which requires some knowledge in advance, in addition to selecting the initial centers which affect the quality of the final result and the number of iterations. The quality of the final result and the number of iterations depend on both k and initial centers. Many papers tried to detect a suitable value for k (the number of clusters) or introduced a better method for selecting the initial centers only. This research introduces a method able to detect a near-optimal value for k and near-optimal initial centers. The proposed method adds a preprocessing step to get the number of clusters and the initial centers before applying the k-means method. The idea is to get initial clusters using a density-based method that does not require the number of clusters in advance and computes the mean values for objects in each cluster and uses this knowledge in k-means. This leads to improving the quality of the final result as presented in the experimental results. The proposed method will use the DBSCAN "Density-based spatial clustering of application with noise" method as a preprocessing step. So, the paper concentrates on the DBSCAN and k-means. The proposed method will converge to global minima which improve the quality of the final result. The proposed method requires the two input parameters for the DBSCAN method and its time complexity is o(n log n) which is the same as that of DBSCAN.

著录项

来源
《Journal of computational science》 |2021年第10期|101445.1-101445.15|共15页
作者
Fahim Ahmed;
展开▼
作者单位

Prince Sattam Bin Abdulaziz Univ, Fac Sci & Humanities Studies, Aflaj, Saudi Arabia|Suez Univ, Fac Comp & Informat, Suez, Egypt;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Data clustering; k in k-means; Starting means in k-means;

K and starting means for k-means algorithm

摘要

著录项

相关主题

期刊订阅