首页> 外文会议>International Conference on Computational Intelligence and Security >A New Weight Based Density Peaks Clustering Algorithm for Numerical and Categorical Data
【24h】

A New Weight Based Density Peaks Clustering Algorithm for Numerical and Categorical Data

机译:一种新的基于重量的数字和分类数据的密度峰集聚类算法

获取原文

摘要

Discovering the potential group structure of objects is of crucial importance to data mining. Most of the existing clustering approaches are applicable only to purely numerical or categorical data, and only a few approaches can deal with both numerical and categorical attributes recently, however, these approaches often need higher computational cost. To cluster data with both numerical and categorical attributes efficiently, in this paper, we propose a new approach with the following schemes. First, a measure of the importance of each categorical attribute is designed and a method to generate the weight of each categorical attribute is proposed based on this measure. Then a unified distance metric is proposed by combining the distance for the numerical part and that for the categorical part with weights. Furthermore, combining the new weights into method in [1], an improved density peaks clustering algorithm is presented. Finally, the experimental results show the efficiency of the proposed approach.
机译:发现对象的潜在组结构对数据挖掘至关重要。大多数现有的聚类方法仅适用于纯粹数值或分类数据,并且只有几种方法可以达到最近的数值和分类属性,然而,这些方法通常需要更高的计算成本。在本文中有效地与数值和分类属性进行群集数据,我们提出了一种新方法,具有以下方案。首先,设计了每个分类属性的重要性的衡量标准,并基于该测量来提出生成每个分类属性权重的方法。然后,通过将数值部分的距离与重量的分类部分组合来提出统一距离度量。此外,将新的权重结合到[1]中的方法中,提出了一种改进的密度峰聚类算法。最后,实验结果表明了所提出的方法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号