首页> 外文会议>Australasian joint conference on artificial intelligence >Finding the k in K-means Clustering: A Comparative Analysis Approach
【24h】

Finding the k in K-means Clustering: A Comparative Analysis Approach

机译:在K均值聚类中找到k:一种比较分析方法

获取原文

摘要

This paper explores the application of inequality indices, a concept successfully applied in comparative software analysis among many application domains, to find the optimal value k for k-means when clustering road traffic data. We demonstrate that traditional methods for identifying the optimal value for k (such as gap statistic and Pham et al.'s method) are unable to produce meaningful values for k when applying them to a real-world dataset for road traffic. On the other hand, a method based on inequality indices shows significant promises in producing much more sensible values for the number k of clusters to be used in k-means clustering for the same road network traffic dataset.
机译:本文探讨了不等式指标的应用,它是在许多应用领域之间的比较软件分析中成功应用的概念,旨在为道路交通数据聚类时找到k均值的最优值k。我们证明了识别k最佳值的传统方法(例如,间隙统计和Pham等人的方法)在将其应用于现实世界的道路交通数据集时无法产生有意义的k值。另一方面,基于不平等指数的方法显示出巨大的希望,可为同一道路网络交通数据集的k均值聚类中使用的聚类数k产生更合理的值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号