首页> 外文期刊>Procedia Computer Science >Improvements for Determining the Number of Clusters in k-Means for Innovation Databases in SMEs
【24h】

Improvements for Determining the Number of Clusters in k-Means for Innovation Databases in SMEs

机译:确定中小企业创新数据库的k均值聚类数的改进

获取原文
           

摘要

The Automatic Clustering using Differential Evolution (ACDE) is one of the grouping methods capable of automatically determining the number of the cluster. However, ACDE continues making use of the strategy manual to determine the activation threshold of k, which affects its performance. In this study, the problem of ACDE is enhanced using the U Control Chart (UCC). The performance of the proposed method was tested using five data sets from the National Administrative Department of Statistics (DANE - Departamento Administrativo Nacional de Estadísticas) and the Ministry of Commerce, Industry, and Tourism of Colombia for the innovative capacity of Small and Medium-sized Enterprises (SMEs) and were assessed by the Davies Bouldin Index (DBI) and the Cosine Similarity (CS) measure. The results show that the proposed method yields excellent performance compared to prior researches for most datasets with optimal cluster number yet lowest DBI and CS measure. It can be concluded that the UCC method is able to determine k activation threshold in ACDE that caused effective determination of the cluster number for k-means clustering.
机译:使用差分进化自动聚类(ACDE)是能够自动确定聚类数的分组方法之一。但是,ACDE继续使用策略手册来确定k的激活阈值,这会影响k的性能。在这项研究中,使用U控制图(UCC)增强了ACDE问题。使用国家统计局(DANE-国民经济管理部)和哥伦比亚商业,工业及旅游部的五个数据集对中小型企业的创新能力进行了测试,验证了该方法的性能。企业(SME)并通过Davies Bouldin指数(DBI)和余弦相似度(CS)度量进行了评估。结果表明,与以往的研究相比,对于大多数具有最佳聚类数但最低DBI和CS度量的数据集,该方法具有出色的性能。可以得出结论,UCC方法能够确定ACDE中的k激活阈值,从而有效确定k均值聚类的簇数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号