...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A novel approach to fuzzy clustering based on a dissimilarity relation extracted from data using a TS system
【24h】

A novel approach to fuzzy clustering based on a dissimilarity relation extracted from data using a TS system

机译:一种基于TS系统从数据中提取的不相似关系的模糊聚类新方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Clustering refers to the process of unsupervised partitioning of a data set based on a dissimilarity measure, which determines the cluster shape. Considering that cluster shapes may change from one cluster to another, it would be of the utmost importance to extract the dissimilarity measure directly from the data by means of a data model. On the other hand, a model construction requires some kind of supervision of the data structure, which is exactly what we look for during clustering. So, the lower the supervision degree used to build the data model, the more it makes sense to resort to a data model for clustering purposes. Conscious of this, we propose to exploit very few pairs of patterns with known dissimilarity to build a TS system which models the dissimilarity relation. Among other things, the rules of the TS system provide an intuitive description of the dissimilarity relation itself. Then we use the TS system to build a dissimilarity matrix which is fed as input to an unsupervised fuzzy relational clustering algorithm, denoted any relation clustering algorithm (ARCA), which partitions the data set based on the proximity of the vectors containing the dissimilarity values between each pattern and all the other patterns in the data set. We show that combining the TS system and the ARCA algorithm allows us to achieve high classification performance on a synthetic data set and on two real data sets. Further, we discuss how the rules of the TS system represent a sort of linguistic description of the dissimilarity relation. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
机译:聚类是指基于相异性度量对数据集进行无监督分区的过程,该过程确定了聚类的形状。考虑到群集形状可能会从一个群集更改为另一个群集,因此最重要的是直接借助数据模型从数据中提取差异性度量。另一方面,模型构建需要某种形式的数据结构监督,这正是我们在聚类过程中所寻找的。因此,用于构建数据模型的监督程度越低,出于集群目的而诉诸数据模型就越有意义。意识到这一点,我们建议利用极少数具有已知相似性的模式来构建对相似性关系进行建模的TS系统。 TS系统的规则尤其提供了不相似关系本身的直观描述。然后,我们使用TS系统构建一个相异度矩阵,该相异度矩阵作为输入被输入到无监督的模糊关系聚类算法(表示为任何关系聚类算法(ARCA)),该算法根据包含相似度值之间的向量的接近度对数据集进行分区数据集中的每个模式和所有其他模式。我们表明,结合TS系统和ARCA算法,可以使我们在合成数据集和两个真实数据集上实现较高的分类性能。此外,我们讨论了TS系统的规则如何表示一种对相似关系的语言描述。 (c)2006模式识别学会。由Elsevier Ltd.出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号