Improved K-Means Algorithm on Home Industry Data Clustering in the Province of Bangka Belitung

机译：Bangka Belitung省内家庭行业数据集群的改进k均值算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Government of Bangka Belitung Islands Province has not classified the home industry until now. Based on these problems, we propose a k-means algorithm for clustering home industry data. The k-means algorithm is widely used because it is straightforward and very suitable for grouping data. However, in its application, the k-means algorithm has a weakness in determining the starting point of the cluster center and, in its selection, is still carried out randomly. As a result, if the random value for initializing the initial centroid value is not right, then the grouping is less than optimal. Internal cluster validation is one way to determine the optimal cluster without knowing prior information from the data. This study aims to identify the optimal group by making improvements to the k-means algorithm and then to test it by applying an internal cluster, namely the Davies-Bouldin Index (DBI) and the Silhouette Index (SI) on the data of home industry in Bangka Belitung Island Province. The optimal cluster calculation results based on internal cluster validation both show that the Silhouette index and the DBI index with k = 3 on improved k-means algorithm. While the traditional k-means algorithm of internal cluster validation both show that the Silhouette index and the Davies-Bouldin Index with k = 2. The conclusion is k = 3 on the Davies-Bouldin Index of this research data gives good results for clustering home industry data in Bangka Belitung Islands Province.

机译：Bangka Belitung Islands省的政府迄今并未将家庭行业分列。基于这些问题，我们提出了一种用于聚类家庭行业数据的K均值算法。 K-Means算法被广泛使用，因为它很简单，非常适合于分组数据。然而，在其应用中，K-Means算法在确定集群中心的起点时具有弱点，并且在其选择中仍然随机进行。结果，如果要初始化初始质心值的随机值不对，则分组小于最佳状态。内部群集验证是确定最佳群集的一种方法，而不知道来自数据的先前信息。本研究旨在通过改进K-Means算法来识别最佳组，然后通过应用内部集群来测试它，即戴维斯 - 博尔德指数（DBI）和家庭行业数据上的轮廓索引（SI）在曼谷贝利恩岛省。基于内部群集验证的最佳聚类计算结果显示，在改进的K均值算法上，剪影索引和带有k = 3的DBI索引。虽然传统的内部集群验证算法兼出了剪影索引和戴维斯 - Bouldin指数与k = 2。结论是k = 3对该研究数据的Davies-bouldin指数上的k = 3给出了聚类家庭的良好结果曼卡贝斯岛省的行业数据。

著录项

来源
《International Conference on Smart Technology and Applications》|2020年|1 v.|共6页
会议地点
作者
Hadi Santoso; Hilyah Magdalena;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
k-means algorithm; data clustering; cluster validation index; home industry;

机译：K-means算法;数据聚类;集群验证指数;家庭行业;
入库时间 2022-08-21 06:03:27

相似文献

外文文献
中文文献
专利

1. The Role of Government Assistance to Generate Competitive Leadership, Commitment, Motivation, Innovation, Environment and its Impact on the Performance of TenunCual Union Industry Cluster in Bangka Belitung Province [J] . Rudy Aryanto, Maria Fransiska Procedia - Social and Behavioral Sciences . 2012,第5期

机译：政府援助在产生竞争性领导，承诺，动机，创新，环境中的作用及其对孟加拉孟加拉省TenunCual联盟产业集群绩效的影响
2. Clustering of User Behaviour based on Web Log data using Improved K-Means Clustering Algorithm [J] . S.Padmaja, Dr.Ananthi Sheshasaayee International Journal of Engineering and Technology . 2016,第1期

机译：基于Web日志数据的用户行为使用改进的K-means群集算法群集
3. An initial seed selection algorithm for k-means clustering of georeferenced data to improve replicability of cluster assignments for mapping application [J] . Fouad Khan Applied Soft Computing . 2012,第11期

机译：用于地理参考数据的k均值聚类的初始种子选择算法，以提高用于地图绘制应用的聚类分配的可复制性
4. Improved K-Means Algorithm on Home Industry Data Clustering in the Province of Bangka Belitung [C] . Hadi Santoso, Hilyah Magdalena International Conference on Smart Technology and Applications . 2020

机译：孟加拉邦省家庭产业数据聚类的改进K-Means算法
5. Clustering educational digital library usage data: Comparisons of latent class analysis and K-means algorithms [D] . Xu, Beijie 2011

机译：聚集教育数字图书馆使用数据：潜在类别分析和K-means算法的比较
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. K-Means Algorithm For Clustering Poverty Data in Bangka Belitung Island Province [O] . Castaka Agus Sugianto, Tri Pratiwi Olivia Riska Bokings 2021

机译：K-Means曼谷群体群体群体算法

Improved K-Means Algorithm on Home Industry Data Clustering in the Province of Bangka Belitung

摘要

著录项

相似文献

相关主题

期刊订阅