首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >A Clustering-Oriented Closeness Measure Based on Neighborhood Chain and Its Application in the Clustering Ensemble Framework Based on the Fusion of Different Closeness Measures
【2h】

A Clustering-Oriented Closeness Measure Based on Neighborhood Chain and Its Application in the Clustering Ensemble Framework Based on the Fusion of Different Closeness Measures

机译:基于邻域链的聚类贴近度度量及其在融合不同贴近度度量的聚类集成框架中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Closeness measures are crucial to clustering methods. In most traditional clustering methods, the closeness between data points or clusters is measured by the geometric distance alone. These metrics quantify the closeness only based on the concerned data points’ positions in the feature space, and they might cause problems when dealing with clustering tasks having arbitrary clusters shapes and different clusters densities. In this paper, we first propose a novel Closeness Measure between data points based on the Neighborhood Chain (CMNC). Instead of using geometric distances alone, CMNC measures the closeness between data points by quantifying the difficulty for one data point to reach another through a chain of neighbors. Furthermore, based on CMNC, we also propose a clustering ensemble framework that combines CMNC and geometric-distance-based closeness measures together in order to utilize both of their advantages. In this framework, the “bad data points” that are hard to cluster correctly are identified; then different closeness measures are applied to different types of data points to get the unified clustering results. With the fusion of different closeness measures, the framework can get not only better clustering results in complicated clustering tasks, but also higher efficiency.
机译:紧密度度量对于聚类方法至关重要。在大多数传统的聚类方法中,仅通过几何距离来测量数据点或聚类之间的紧密度。这些度量仅根据相关数据点在特征空间中的位置来量化紧密度,并且在处理具有任意聚类形状和不同聚类密度的聚类任务时可能会引起问题。在本文中,我们首先提出一种新的基于邻域链(CMNC)的数据点之间的紧密度度量。 CMNC并没有单独使用几何距离,而是通过量化一个数据点通过邻居链到达另一个数据点的难度来测量数据点之间的紧密度。此外,基于CMNC,我们还提出了一个聚类集成框架,将CMNC和基于几何距离的接近度度量结合在一起,以利用它们的两个优势。在此框架中,确定了难以正确聚类的“不良数据点”;然后对不同类型的数据点采用不同的接近度度量,以获得统一的聚类结果。通过融合不同的紧密度度量,该框架不仅可以在复杂的聚类任务中获得更好的聚类结果,而且可以获得更高的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号