首页> 外文会议>International Conference on Artificial Intelligence and Soft Computing;ICAISC 2014 >DenClust: A Density Based Seed Selection Approach for K-Means
【24h】

DenClust: A Density Based Seed Selection Approach for K-Means

机译:Denclust:K-Means的基于密度的种子选择方法

获取原文

摘要

In this paper we present a clustering technique called Den- Clust that produces high quality initial seeds through a deterministic process without requiring an user input on the number of clusters k and the radius of the clusters r. The high quality seeds are given input to K-Means as the set of initial seeds to produce the final clusters. DenClust uses a density based approach for initial seed selection. It calculates the density of each record, where the density of a record is the number of records that have the minimum distances with the record. This approach is expected to produce high quality initial seeds for K-Means resulting in high quality clusters from a dataset. The performance of DenClust is compared with five (5) existing techniques namely CRUDAW, AGCUK, Simple K-means (SK), Basic Farthest Point Heuristic (BFPH) and New Farthest Point Heuristic (NFPH) in terms of three (3) external cluster evaluation criteria namely F-Measure, Entropy, Purity and two (2) internal cluster evaluation criteria namely Xie-Beni Index (XB) and Sum of Square Error (SSE). We use three (3) natural datasets that we obtain from the UCI machine learning repository. DenClust performs better than all five existing techniques in terms of all five evaluation criteria for all three datasets used in this study.
机译:在本文中,我们介绍了一种称为Den-Clust的聚类技术,其通过确定性过程产生高质量的初始种子,而不需要对簇K的数量和簇R的半径输入的用户输入。高质量的种子被给予K-Meance作为初始种子的一组,以产生最终簇。 Denclust使用基于密度的初始种子选择方法。它计算每个记录的密度,其中记录的密度是记录的记录数量,其具有记录的最小距离。预计这种方法将为K-meails生产高质量的初始种子,从而产生来自数据集的高质量簇。将Denclust的性能与五(5)个现有技术进行比较,即Crudaw,Agcuk,简单的K-Meance(SK),基本最远的点启发式(BFPH)和新的最远点启发式(NFPH),其三(3)个外部群集评估标准即F测量,熵,纯度和两(2)内部群集评估标准即Xie-Beni指数(XB)和方误差(SSE)的总和。我们使用从UCI机器学习存储库获得的三(3)个自然数据集。根据本研究中使用的所有三个数据集的所有五个评估标准,Denclust在所有五个评估标准方面表现优于所有五种现有技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号