...
首页> 外文期刊>Knowledge-Based Systems >Hierarchical clustering algorithm for categorical data using a probabilistic rough set model
【24h】

Hierarchical clustering algorithm for categorical data using a probabilistic rough set model

机译:使用概率粗糙集模型的分类数据分层聚类算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Several clustering analysis techniques for categorical data exist to divide similar objects into groups. Some are able to handle uncertainty in the clustering process, whereas others have stability issues. In this paper, we propose a new technique called TMDP (Total Mean Distribution Precision) for selecting the partitioning attribute based on probabilistic rough set theory. On the basis of this technique, with the concept of granularity, we derive a new clustering algorithm, MTMDP (Maximum Total Mean Distribution Precision), for categorical data. The MTMDP algorithm is a robust clustering algorithm that handles uncertainty in the process of clustering categorical data. We compare the MTMDP algorithm with the MMR (Min-Min-Roughness) algorithm which is the most relevant clustering algorithm, and also compared it with other unstable clustering algorithms, such as k-modes, fuzzy k-modes and fuzzy centroids. The experimental results indicate that the MTMDP algorithm can be successfully used to analyze grouped categorical data because it produces better clustering results.
机译:存在几种用于分类数据的聚类分析技术,以将相似的对象划分为组。有些能够处理聚类过程中的不确定性,而另一些则具有稳定性问题。在本文中,我们提出了一种新的技术,称为TMDP(总均值分布精度),用于基于概率粗糙集理论选择分区属性。在此技术的基础上,借助粒度概念,我们得出了用于分类数据的新聚类算法MTMDP(最大总平均分布精度)。 MTMDP算法是一种鲁棒的聚类算法,可以处理对分类数据进行聚类的不确定性。我们将MTMDP算法与最相关的聚类算法MMR(最小-最小-粗糙度)算法进行了比较,并将其与其他不稳定的聚类算法(例如,k模式,模糊k模式和模糊质心)进行了比较。实验结果表明,MTMDP算法可以产生更好的聚类结果,因此可以成功地用于分析分类的分类数据。

著录项

  • 来源
    《Knowledge-Based Systems》 |2014年第7期|60-71|共12页
  • 作者单位

    Nanchang Institute of Technology, Nanchang, Jiangxi 330099, PR China;

    Nanchang Institute of Technology, Nanchang, Jiangxi 330099, PR China,Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, PR China,Graduate School of Chinese Academy of Sciences, Beijing 100080, PR China;

    Nanchang Institute of Technology, Nanchang, Jiangxi 330099, PR China;

    Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, PR China;

    Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, PR China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Cluster analysis; Categorical data; Probabilistic rough sets; Distribution approximation precision; Approximation accuracy;

    机译:聚类分析;分类数据;概率粗糙集;分布近似精度;近似精度;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号