首页> 外文期刊>Digital Signal Processing >Enhancing minimum spanning tree-based clustering by removing density-based outliers
【24h】

Enhancing minimum spanning tree-based clustering by removing density-based outliers

机译:通过消除基于密度的异常值来增强基于最小生成树的聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Traditional minimum spanning tree-based clustering algorithms only make use of information about edges contained in the tree to partition a data set. As a result, with limited information about the structure underlying a data set, these algorithms are vulnerable to outliers. To address this issue, this paper presents a simple while efficient MST-inspired clustering algorithm. It works by finding a local density factor for each data point during the construction of an MST and discarding outliers, i.e., those whose local density factor is larger than a threshold, to increase the separation between clusters. This algorithm is easy to implement, requiring an implementation of iDistance as the only k-nearest neighbor search structure. Experiments performed on both small low-dimensional data sets and large high-dimensional data sets demonstrate the efficacy of our method.
机译:传统的基于最小生成树的聚类算法仅利用有关树中包含的边缘的信息来对数据集进行分区。结果,由于有关数据集基础结构的信息有限,因此这些算法容易受到异常值的影响。为了解决这个问题,本文提出了一种简单而有效的MST启发式聚类算法。它通过在构造MST期间为每个数据点找到局部密度因子并丢弃离群值(即那些局部密度因子大于阈值的离群值)来增加聚类之间的间隔而工作。该算法易于实现,需要将iDistance实现为唯一的k最近邻搜索结构。在小型低维数据集和大型高维数据集上进行的实验证明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号