Multiobjective Genetic Algorithm-Based Fuzzy Clustering of Categorical Attributes

Mukhopadhyay A.; Maulik U.; Bandyopadhyay S.

首页> 外文期刊>Evolutionary Computation, IEEE Transactions on >Multiobjective Genetic Algorithm-Based Fuzzy Clustering of Categorical Attributes

【24h】

Multiobjective Genetic Algorithm-Based Fuzzy Clustering of Categorical Attributes

机译：基于多目标遗传算法的分类属性模糊聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the problem of clustering categorical data, where no natural ordering among the elements of a categorical attribute domain can be found, has been gaining significant attention from researchers. With the growing demand for categorical data clustering, a few clustering algorithms with focus on categorical data have recently been developed. However, most of these methods attempt to optimize a single measure of the clustering goodness. Often, such a single measure may not be appropriate for different kinds of datasets. Thus, consideration of multiple, often conflicting, objectives appears to be natural for this problem. Although we have previously addressed the problem of multiobjective fuzzy clustering for continuous data, these algorithms cannot be applied for categorical data where the cluster means are not defined. Motivated by this, in this paper a multiobjective genetic algorithm-based approach for fuzzy clustering of categorical data is proposed that encodes the cluster modes and simultaneously optimizes fuzzy compactness and fuzzy separation of the clusters. Moreover, a novel method for obtaining the final clustering solution from the set of resultant Pareto-optimal solutions in proposed. This is based on majority voting among Pareto front solutions followed by $k$-nn classification. The performance of the proposed fuzzy categorical data-clustering techniques has been compared with that of some other widely used algorithms, both quantitatively and qualitatively. For this purpose, various synthetic and real-life categorical datasets have been considered. Also, a statistical significance test has been conducted to establish the significant superiority of the proposed multiobjective approach.

机译：最近，聚类分类数据的问题已引起研究人员的极大关注，在分类数据中无法找到分类属性域的元素之间的自然顺序。随着对分类数据聚类的需求不断增长，最近已经开发了一些针对分类数据的聚类算法。但是，大多数这些方法都试图优化聚类优度的单个度量。通常，这种单一度量可能不适用于不同种类的数据集。因此，对于这个问题，考虑多个目标（通常是相互冲突的）似乎是很自然的。尽管我们先前已经解决了连续数据的多目标模糊聚类的问题，但是这些算法无法应用于未定义聚类平均值的分类数据。为此，本文提出了一种基于多目标遗传算法的分类数据模糊聚类方法，该方法对聚类模式进行编码，同时优化了聚类的模糊紧度和模糊分离。此外，提出了一种新的方法，该方法用于从一组所得的帕累托最优解中获取最终的聚类解。这基于Pareto前沿解决方案中的多数投票，然后进行$ k $ -nn分类。所提出的模糊分类数据聚类技术的性能已与其他一些广泛使用的算法进行了定量和定性的比较。为了这个目的，已经考虑了各种合成的和真实的分类数据集。此外，已经进行了统计显着性检验，以建立提出的多目标方法的显着优势。

著录项

来源
《Evolutionary Computation, IEEE Transactions on》 |2009年第5期|p.991-1005|共15页
作者
Mukhopadhyay A.; Maulik U.; Bandyopadhyay S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Categorical attributes; fuzzy clustering; multiobjective genetic algorithm; pareto optimality;

机译：分类属性;模糊聚类;多目标遗传算法;最优;

相似文献

外文文献
中文文献
专利

1. EGA-FMC: enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data [J] . Medhini Narasimhan, Balaji Balasubramanian, Suryansh D. Kumar, International Journal of Bio-Inspired Computation . 2018,第4期

机译：EGA-FMC：基于增强的基于遗传算法的模糊k模式，用于分类数据
2. GACC: genetic algorithm-based categorical data clustering for large datasets [J] . Abha Sharma, R.S. Thakur International journal of data mining, modelling and management . 2017,第4期

机译：GACC：用于大型数据集的基于遗传算法的分类数据聚类
3. Enhanced genetic algorithm-based fuzzy multiobjective strategy to multiproduct batch plant design [J] . A. A. Aguilar-Lasserre, L. Pibouleau, C. Azzaro-Pantel, Applied Soft Computing . 2009,第4期

机译：基于增强遗传算法的模糊多目标多批次工厂设计策略
4. Multiobjective Genetic Fuzzy Clustering of Categorical Attributes [C] . Mukhopadhyay Anirban, Maulik Ujjwal, Bandyopadhyay Sanghamitra, International Conference on Information Technology . 2007

机译：分类属性的多目标基因模糊聚类
5. Genetic algorithm-based optimization in the development of tropospheric ozone control strategies: Least cost, multiobjective, alternative generation, and chance-constrained applications. [D] . Loughlin, Daniel Hopkins. 1998

机译：对流层臭氧控制策略开发中基于遗传算法的优化：成本最低，多目标，替代发电和机会受限的应用。
6. Fuzzy C-Means Clustering Algorithm-Based Magnetic Resonance Imaging Image Segmentation for Analyzing the Effect of Edaravone on the Vascular Endothelial Function in Patients with Acute Cerebral Infarction [O] . Jie Yin, Hong Chang, Dongmei Wang, 2021

机译：基于模糊的C型聚类算法磁共振成像图像分析用于分析埃达拉夫酮对急性脑梗死患者血管内皮功能的影响
7. Enhanced genetic algorithm-based fuzzy multiobjective strategy to multiproduct batch plant design [O] . Aguilar-Lasserre Alberto, Pibouleau Luc, Azzaro-Pantel Catherine, 2009

机译：基于增强遗传算法的模糊多目标多批次工厂设计策略

Multiobjective Genetic Algorithm-Based Fuzzy Clustering of Categorical Attributes

摘要

著录项

相似文献

相关主题

期刊订阅