首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >An Indiscernibility-Based Clustering Method with Iterative Refinement of Equivalence Relations - Rough Clustering
【24h】

An Indiscernibility-Based Clustering Method with Iterative Refinement of Equivalence Relations - Rough Clustering

机译:一种基于不可辨度的迭代精化等价关系的聚类方法-粗聚类

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a new indiscernibility-based clustering method called rough clustering, that can handle relative proximity. Relative proximity is a class of proximity measures that can be used to represent subjective similarity or dissimilarity; such as human judgment about likeness of persons. Since relative proximity is not necessarily required to satisfy the triangular inequality, conventional centroid-based clustering methods may fail to produce good clusters due to the inappropriate assignment of cluster representatives. Our method is based on iterative refinement of N binary classifications, where N denotes the number of objects. First, an equivalence relation, that classifies all the other objects into two classes, is assign for each of N objects, an equivalence relation that classifies all the other objects into two classes, similar and dissimilar is assigned by referring to their relative proximity. Next, for each pair of the objects, we count the number of binary classifications in which the pair is included in the same class. We call this number as indiscernibility degree. If the indiscernibility degree of a pair is larger than a user-defined threshold value, we modify the equivalence relations so that all of them commonly classify the pair into the same class. This process is repeated until class assignment becomes stable. Consequently, we obtain the clustering result that follows given level of granularity without using geometric measures.
机译:本文提出了一种新的基于不可区分性的聚类方法,称为粗糙聚类,可以处理相对接近度。相对接近度是一类可以用来表示主观相似性或相异性的接近度度量;例如人类对人物相似度的判断。由于不一定需要相对接近度来满足三角形不等式,因此传统的基于质心的聚类方法可能会由于聚类代表的分配不当而无法生成良好的聚类。我们的方法基于N个二元分类的迭代细化,其中N表示对象的数量。首先,为N个对象中的每一个分配一个将所有其他对象分为两类的等价关系,通过引用它们的相对接近度,将所有其他对象分为两个类的等价关系被分配。接下来,对于每对对象,我们计算该对象对包含在同一类中的二进制分类的数量。我们称这个数字为不可分辨度。如果一对的不可分辨度大于用户定义的阈值,则我们修改等价关系,以使它们共同将该对归为同一类。重复此过程,直到类分配变得稳定为止。因此,我们无需使用几何度量即可获得遵循给定粒度级别的聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号