首页> 外文期刊>Fuzzy Systems, IEEE Transactions on >Comparing Fuzzy, Probabilistic, and Possibilistic Partitions
【24h】

Comparing Fuzzy, Probabilistic, and Possibilistic Partitions

机译:比较模糊,概率和可能性分区

获取原文
获取原文并翻译 | 示例
           

摘要

When clustering produces more than one candidate to partition a finite set of objects ${bf O}$, there are two approaches to validation (i.e., selection of a “best” partition, and implicitly, a best value for $c$ , which is the number of clusters in ${bf O}$). First, we may use an internal index, which evaluates each partition separately. Second, we may compare pairs of candidates with each other, or with a reference partition that purports to represent the “true” cluster structure in the objects. This paper generalizes many of the classical indices that have been used with outputs of crisp clustering algorithms so that they are applicable for candidate partitions of any type (i.e., crisp or soft, with soft comprising the fuzzy, probabilistic, and possibilistic cases). Space prevents inclusion of all of the possible generalizations that can be realized this way. Here, we concentrate on the Rand index and its modifications. We compare our fuzzy-Rand index with those of Campello, Hullermeier and Rifqi, and Brouwer, and show that our extension of the Rand index is $O$($n$), while the other three are all $O(n^{2})$. Numerical examples are given to illustrate various facets of the new indices. In particular, we show that our indices can be used, even when the partitions are probabilistic or possibilistic, and that our method of generalization is valid for any index that depends only on the entries of the classical (i.e., four-pair types) contingency table for this problem.
机译:当聚类产生多个候选者来划分一组有限的对象$ {bf O} $时,有两种验证方法(即,选择“最佳”分区,并隐式地为$ c $选择最佳值)是$ {bf O} $中的簇数)。首先,我们可以使用内部索引,该索引分别评估每个分区。其次,我们可以将候选对象对彼此进行比较,或者将其与旨在代表对象中“真实”群集结构的参考分区进行比较。本文归纳了许多用于脆性聚类算法输出的经典索引,以便它们适用于任何类型的候选分区(即脆性或软性,软性包括模糊,概率和可能的情况)。空间会阻止包含所有可以通过这种方式实现的概括。在这里,我们集中于兰德指数及其修改。我们将Campan,Hullermeier,Rifqi和Brouwer的模糊兰德指数进行了比较,结果表明,兰德指数的扩展名是$ O $($ n $),而其他三个都是$ O(n ^ { 2})$。数值例子说明了新指标的各个方面。特别是,我们表明即使分区是概率性的或可能性的,我们的索引也可以使用,并且我们的概括方法对于仅依赖于经典(即四对类型)偶发事件条目的任何索引均有效这个问题的表格。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号