...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >An indication of unification for different clustering approaches
【24h】

An indication of unification for different clustering approaches

机译:不同集群方法统一的标志

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The question of finding generic concepts and properties common to the different clustering approaches is a current problem. This inquire is addressed most thoroughly in Kleinberg's paper on the Impossibility Theorem (see [1]). Kleinberg introduced the notion of clustering function-a function that takes a dissimilarity measure defined on a data set S and returns a partition of S; and a set of simple properties for the study of such functions-Scale Invariance, Richness and Consistency. The main result of [1] is the Impossibility Theorem: there is no clustering method satisfying all these properties. This study has been accepted as a rigorous proof of the difficulty in finding a unified framework for different clustering approaches. Our goal in this paper is to provide primary concepts and results for the formal study of the various clustering approaches. To accomplish this, we discuss and expand on the ideas introduced by Kleinberg. Our guiding philosophy is to incorporate a crucial fact overlooked in the study conducted in [1]-clustering methods not only depend on the dissimilarity measure but also on other parameters such as dissimilarity thresholds, centroids, stop criteria, among others. This paper gives a formal definition of clustering method and reformulates the afore-mentioned properties, even it introduces some new. Contrary to the result obtained in [1], many of the methods discussed here satisfy all of our properties. With all these grounds in hand we glimpse a clue of unification among the different clustering approaches.
机译:寻找不同聚类方法共有的通用概念和属性的问题是当前的问题。在Kleinberg关于不可能定理的论文中最彻底地解决了这个问题(见[1])。克莱因伯格(Kleinberg)引入了聚类函数的概念-一种对数据集S进行定义并采取相异度量并返回S的分区的函数。以及一组用于研究此类函数的简单属性-尺度不变性,丰富性和一致性。 [1]的主要结果是不可能定理:没有满足所有这些特性的聚类方法。这项研究已被接受为证明为不同聚类方法找到统一框架的困难的严格证明。本文的目的是为各种聚类方法的正式研究提供主要概念和结果。为此,我们讨论并扩展了Kleinberg提出的想法。我们的指导思想是纳入在[1]聚类方法研究中被忽略的一个关键事实,它不仅取决于相异性度量,而且还取决于诸如相异性阈值,质心,停止标准等其他参数。本文给出了聚类方法的正式定义,并重新介绍了上述特性,甚至引入了一些新特性。与[1]中获得的结果相反,这里讨论的许多方法都满足我们的所有特性。有了所有这些基础,我们就可以了解不同聚类方法之间的统一线索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号