首页> 外文期刊>Expert systems with applications >initKmix-A novel initial partition generation algorithm for clustering mixed data using k-means-based clustering
【24h】

initKmix-A novel initial partition generation algorithm for clustering mixed data using k-means-based clustering

机译:initkmix-一种新颖的初始分区生成算法,用于使用基于k均值的群集聚类混合数据

获取原文
获取原文并翻译 | 示例

摘要

Mixed datasets consist of both numeric and categorical attributes. Various k-means-based clustering algorithms have been developed for these datasets. Generally, these algorithms use random partition as a starting point, which tends to produce different clustering results for different runs. In this paper, we propose, initKmix, a novel algorithm for finding an initial partition for k-means-based clustering algorithms for mixed datasets. In the initKmix algorithm, a k-means-based clustering algorithm is run many times, and in each run, one of the attributes is used to create initial clusters for that run. The clustering results of various runs are combined to produce the initial partition. This initial partition is then used as a seed to a k-means-based clustering algorithm to cluster mixed data. Experiments with various categorical and mixed datasets showed that initKmix produced accurate and consistent results, and outperformed the random initial partition method and other state-of-the-art initialization methods. Experiments also showed that k-means-based clustering for mixed datasets with initKmix performed similar to or better than many state-of-the-art clustering algorithms for categorical and mixed datasets.
机译:混合数据集包括数字和分类属性。已经为这些数据集开发了各种基于K-means的聚类算法。通常,这些算法使用随机分区作为起点,这倾向于为不同的运行产生不同的聚类结果。在本文中,我们提出了initkmix,一种用于查找混合数据集的基于k-means的聚类算法的初始分区的新颖算法。在initkmix算法中,基于k均值的群集算法很多次运行,并且在每个运行中,其中一个属性用于为该运行创建初始群集。组合各种运行的聚类结果以产生初始分区。然后将该初始分区用作基于K-means的聚类算法的种子以群集混合数据。具有各种分类和混合数据集的实验表明,initkmix产生了准确和一致的结果,并且优于随机初始分区方法和其他最先进的初始化方法。实验还表明,与initkmix的混合数据集的基于k-means的聚类类似于用于分类和混合数据集的许多最先进的聚类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号