首页> 外文会议>IEEE International Conference on Fuzzy Systems >Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables
【24h】

Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables

机译:基于聚类的混合数据库的线性模糊聚类基于集群 - 明智的分类变量的最佳缩放

获取原文

摘要

This paper proposes a new approach to linear fuzzy clustering of mixed databases, in which categorical variables are quantified in each cluster based on optimal scaling. The objective function of the Fuzzy c-Varieties (FCV) clustering is defined using least squares criterion, and local principal component analysis (local PCA) is then performed in each cluster considering quantified scores of categorical variables. The new approach quantifies categorical variables in each cluster so that they suit the local linear model of the cluster. So, this is the second approach to optimal scaling in linear fuzzy clustering and contrasts to the global approach where categorical variables are quantified so that they suit for constructing a single numerical data space. The clustering algorithm is an enhanced FCV algorithm that includes an additional step for quantifying categorical variables in each cluster, and is useful for revealing cluster-wise mutual dependencies among numerical and nominal variables rather than for revealing geometrical relationships among data samples.
机译:本文提出了一种新的混合数据库线性模糊聚类方法,其中基于最佳缩放,在每个集群中量化分类变量。模糊C-品种(FCV)聚类的目标函数使用最小二乘标准定义,然后考虑定量分类变量的定量分数,在每个集群中执行局部主成分分析(本地PCA)。新方法量化每个群集中的分类变量,以便它们适合集群的本地线性模型。因此,这是在线性模糊聚类中最佳缩放的第二种方法,并与全局方法对比,其中定量分类变量,使得它们适合构造单个数值数据空间。聚类算法是增强型FCV算法,其包括用于量化每个群集中的分类变量的附加步骤,并且可用于在数值和标称变量之间揭示聚类相互依赖性,而不是为了揭示数据样本之间的几何关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号