Enhancing minimum spanning tree-based clustering by removing density-based outliers

Wang X.; Wang X.L.; Chen C.; Wilkes D.M.

首页> 外文期刊>Digital Signal Processing >Enhancing minimum spanning tree-based clustering by removing density-based outliers

【24h】

Enhancing minimum spanning tree-based clustering by removing density-based outliers

机译：通过消除基于密度的异常值来增强基于最小生成树的聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional minimum spanning tree-based clustering algorithms only make use of information about edges contained in the tree to partition a data set. As a result, with limited information about the structure underlying a data set, these algorithms are vulnerable to outliers. To address this issue, this paper presents a simple while efficient MST-inspired clustering algorithm. It works by finding a local density factor for each data point during the construction of an MST and discarding outliers, i.e., those whose local density factor is larger than a threshold, to increase the separation between clusters. This algorithm is easy to implement, requiring an implementation of iDistance as the only k-nearest neighbor search structure. Experiments performed on both small low-dimensional data sets and large high-dimensional data sets demonstrate the efficacy of our method.

机译：传统的基于最小生成树的聚类算法仅利用有关树中包含的边缘的信息来对数据集进行分区。结果，由于有关数据集基础结构的信息有限，因此这些算法容易受到异常值的影响。为了解决这个问题，本文提出了一种简单而有效的MST启发式聚类算法。它通过在构造MST期间为每个数据点找到局部密度因子并丢弃离群值（即那些局部密度因子大于阈值的离群值）来增加聚类之间的间隔而工作。该算法易于实现，需要将iDistance实现为唯一的k最近邻搜索结构。在小型低维数据集和大型高维数据集上进行的实验证明了我们方法的有效性。

著录项

来源
《Digital Signal Processing》 |2013年第5期|共16页
作者
Wang X.; Wang X.L.; Chen C.; Wilkes D.M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 73.41262;
关键词
Clustering; Density-based clustering algorithms; Density-based outliers; Indexing structures; Minimum spanning tree-based clustering algorithms;

机译：聚类;基于密度的聚类算法;基于密度的离群值;索引结构;基于最小生成树的聚类算法;

相似文献

外文文献
中文文献
专利

1. Enhancing minimum spanning tree-based clustering by removing density-based outliers [J] . Wang X., Wang X.L., Chen C., Digital Signal Processing . 2013,第5期

机译：通过消除基于密度的异常值来增强基于最小生成树的聚类
2. Density-based o-means clustering algorithm using minimum spanning tree [J] . Peter S.J. Journal of Discrete Mathematical Sciences and Cryptography . 2012,第4a5期

机译：基于最小生成树的基于密度的o均值聚类算法
3. Density-based o-means clustering algorithm using minimum spanning tree [J] . Peter S.J. Journal of Discrete Mathematical Sciences and Cryptography . 2012,第4a5期

机译：基于最小生成树的基于密度的o均值聚类算法
4. A New Fast Minimum Spanning Tree-Based Clustering Technique [C] . Xiaochun Wang, Wang Xia L., Jihua Zhu IEEE International Conference on Data Mining Workshops . 2014

机译：一种新的基于最小生成树的快速最小化聚类技术
5. A minimum spanning tree based clustering algorithm for high throughput biological data. [D] . Pirim, Harun. 2011

机译：用于高通量生物数据的基于最小生成树的聚类算法。
6. Brain Connectivity and Information-Flow Breakdown Revealed by a Minimum Spanning Tree-Based Analysis of MRI Data in Behavioral Variant Frontotemporal Dementia [O] . Valentina Saba, Enrico Premi, Viviana Cristillo, 2010

机译：行为变异额颞痴呆的MRI数据基于最小生成树的MRI分析揭示了大脑的连通性和信息流分解
7. Local Density-based Hierarchical Clustering for Overlapping Distribution using Minimum Spanning Tree [O] . S. JohnPeter 2012

机译：基于局部密度的分层聚类，用于使用最小生成树重叠分布

Enhancing minimum spanning tree-based clustering by removing density-based outliers

摘要

著录项

相似文献

相关主题

期刊订阅