首页> 外文会议>International Conference on Computer Science and Information Technology >Robust Local Triangular Kernel density-based clustering for high-dimensional data
【24h】

Robust Local Triangular Kernel density-based clustering for high-dimensional data

机译:基于稳健的局部三角核密度的高维数据聚类

获取原文

摘要

A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and densities. In this paper, a new kernel density-based clustering algorithm, called Local Triangular Kernel-based Clustering (LTKC), is proposed to deal with these conditions. LTKC is based on combination of k-nearest-neighbor density estimation and triangular kernel density-based clustering. The advantages of our LTKC approach are: (1) it has a firm mathematical basis; (2) it requires only one parameter, number of neighbors; (3) it defines the number of cluster automatically; (4) it allows discovering clusters with arbitrary shapes and densities ;and (5) it is significantly faster than existing algorithms. LTKC is tested using artificial data and applied to some UCI data. A comparison with k-means, KFCM and well known density-based clustering algorithms including ILGC, DBSCAN, and DENCLUE shows the superiority of our proposed LTKC algorithm.
机译:可以采用多种聚类算法来找到多元数据中的聚类。然而,由于各个数据具有高维,包含大量噪声并且由具有任意形状和密度的簇组成,因此现有算法的有效性和效率受到限制。在本文中,提出了一种新的基于核密度的聚类算法,称为局部三角核聚类(LTKC),以应对这些情况。 LTKC是基于k近邻密度估计和基于三角核密度的聚类的组合。我们的LTKC方法的优点是:(1)它具有牢固的数学基础; (2)仅需要一个参数,即邻居数; (3)自动定义集群数; (4)它允许发现具有任意形状和密度的聚类;(5)它比现有算法快得多。使用人造数据对LTKC进行测试,并将其应用于某些UCI数据。与k均值,KFCM和众所周知的基于密度的聚类算法(包括ILGC,DBSCAN和DENCLUE)进行比较,显示了我们提出的LTKC算法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号