Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables

机译：基于聚类的混合数据库的线性模糊聚类基于集群 - 明智的分类变量的最佳缩放

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new approach to linear fuzzy clustering of mixed databases, in which categorical variables are quantified in each cluster based on optimal scaling. The objective function of the Fuzzy c-Varieties (FCV) clustering is defined using least squares criterion, and local principal component analysis (local PCA) is then performed in each cluster considering quantified scores of categorical variables. The new approach quantifies categorical variables in each cluster so that they suit the local linear model of the cluster. So, this is the second approach to optimal scaling in linear fuzzy clustering and contrasts to the global approach where categorical variables are quantified so that they suit for constructing a single numerical data space. The clustering algorithm is an enhanced FCV algorithm that includes an additional step for quantifying categorical variables in each cluster, and is useful for revealing cluster-wise mutual dependencies among numerical and nominal variables rather than for revealing geometrical relationships among data samples.

机译：本文提出了一种新的混合数据库线性模糊聚类方法，其中基于最佳缩放，在每个集群中量化分类变量。模糊C-品种（FCV）聚类的目标函数使用最小二乘标准定义，然后考虑定量分类变量的定量分数，在每个集群中执行局部主成分分析（本地PCA）。新方法量化每个群集中的分类变量，以便它们适合集群的本地线性模型。因此，这是在线性模糊聚类中最佳缩放的第二种方法，并与全局方法对比，其中定量分类变量，使得它们适合构造单个数值数据空间。聚类算法是增强型FCV算法，其包括用于量化每个群集中的分类变量的附加步骤，并且可用于在数值和标称变量之间揭示聚类相互依赖性，而不是为了揭示数据样本之间的几何关系。

著录项

来源
《IEEE International Conference on Fuzzy Systems》|2007年||共6页
会议地点
作者
Honda Katsuhiro; Uesugi Ryo; Ichihashi Hidetomo; Notsu Akira; FUZZY;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273.4-53;
关键词

相似文献

外文文献
中文文献
专利

1. FCM-Type Fuzzy Clustering of Mixed Databases Considering Nominal Variable Quantification [J] . Katsuhiro Honda, Ryo Uesugi, Hidetomo Ichihashi Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2007,第2期

机译：考虑名义变量量化的混合数据库的FCM类型模糊聚类
2. A novel variable selection approach based on co-linearity index to discover optimal process settings by analysing mixed data [J] . C. Giannetti, R.S. Ransing, M.R. Ransing, Computers & Industrial Engineering . 2014,第juna期

机译：一种基于共线性指标的变量选择方法，通过分析混合数据来发现最佳过程设置
3. Cooperative Coevolution for Large-Scale Optimization Based on Kernel Fuzzy Clustering and Variable Trust Region Methods [J] . Jianchao Fan, Jun Wang, Min Han Fuzzy Systems, IEEE Transactions on . 2014,第4期

机译：基于核模糊聚类和可变信赖域方法的大规模优化协同进化
4. Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables [C] . Honda Katsuhiro, Uesugi Ryo, Ichihashi Hidetomo, IEEE International Conference on Fuzzy Systems . 2007

机译：基于聚类的混合数据库的线性模糊聚类基于集群 - 明智的分类变量的最佳缩放
5. Inventory Management with Order Errors, Optimal Investment to Reduce Emissions, and Generalized Cluster-wise Linear Regression. [D] . Jiang, Yan. 2012

机译：具有订单错误的库存管理，减少排放的最佳投资以及广义的聚类线性回归。
6. Exploring dependence between categorical variables: Benefits and limitations of using variable selection within Bayesian clustering in relation to log-linear modelling with interaction terms [O] . Michail Papathomas, Sylvia Richardson -1

机译：探索类别变量之间的依存关系：在贝叶斯聚类中使用变量选择相对于具有交互项的对数线性建模的好处和局限性
7. Local Principal Component Analysis for Mixed Databases Based on Linear Fuzzy Clustering [O] . Ryo UESUGI, Katsuhiro HONDA, Hidetomo ICHIHASHI, 2007

机译：基于线性模糊聚类的混合数据库本地主成分分析

Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables

摘要

著录项

相似文献

相关主题

期刊订阅