Feature reduction fuzzy C-Means algorithm leveraging the marginal kurtosis measure

Pan Xingguang; Wang Shitong

首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Feature reduction fuzzy C-Means algorithm leveraging the marginal kurtosis measure

【24h】

Feature reduction fuzzy C-Means algorithm leveraging the marginal kurtosis measure

机译：特征减少模糊C型算法利用边缘峰度措施

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The feature reduction fuzzy c-means (FRFCM) algorithm has been proven to be effective for clustering data with redundant/unimportant feature(s). However, the FRFCM algorithm still has the following disadvantages. 1) The FRFCM uses the mean-to-variance-ratio (MVR) index to measure the feature importance of a dataset, but this index is affected by data normalization, i.e., a large MVR value of original feature(s) may become small if the data are normalized, and vice versa. Moreover, the MVR value(s) of the important feature(s) of a dataset may not necessarily be large. 2) The feature weights obtained by the FRFCM are sensitive to the initial cluster centers and initial feature weights. 3) The FRFCM algorithm may be unable to assign the proper weights to the features of a dataset. Thus, in the feature reduction learning process, important features may be discarded, but unimportant features may be retained. These disadvantages can cause the FRFCM algorithm to discard important feature components. In addition, the threshold for the selection of the important feature(s) of the FRFCM may not be easy to determine. To mitigate the disadvantages of the FRFCM algorithm, we first devise a new index, named the marginal kurtosis measure (MKM), to measure the importance of each feature in a dataset. Then, a novel and robust feature reduction fuzzy c-means clustering algorithm called the FRFCM-MKM, which incorporates the marginal kurtosis measure into the FRFCM, is proposed. Furthermore, an accurate threshold is introduced to select important feature(s) and discard unimportant feature(s). Experiments on synthetic and real-world datasets demonstrate that the FRFCM-MKM is effective and efficient.

机译：特征约简模糊c均值（FRFCM）算法已被证明对具有冗余/不重要特征的数据聚类是有效的。然而，FRFCM算法仍然存在以下缺点。1） FRFCM使用均值-方差比（MVR）指数来衡量数据集的特征重要性，但该指数受数据标准化的影响，即，如果数据标准化，原始特征的大MVR值可能变小，反之亦然。此外，数据集重要特征的MVR值不一定很大。2） FRFCM得到的特征权重对初始聚类中心和初始特征权重敏感。3） FRFCM算法可能无法为数据集的特征分配适当的权重。因此，在特征约简学习过程中，重要特征可能会被丢弃，但不重要的特征可能会被保留。这些缺点会导致FRFCM算法丢弃重要的特征组件。此外，选择FRFCM重要特征的阈值可能不容易确定。为了缓解FRFCM算法的缺点，我们首先设计了一个新的索引，称为边际峰度度量（MKM），用于度量数据集中每个特征的重要性。然后，提出了一种新的、鲁棒的特征约简模糊c均值聚类算法FRFCM-MKM，该算法将边缘峭度测度引入到FRFCM中。此外，还引入了一个精确的阈值来选择重要的特征并丢弃不重要的特征。在合成数据集和真实数据集上的实验表明，FRFCM-MKM是有效的。

著录项

来源
《Journal of intelligent & fuzzy systems: Applications in Engineering and Technology 》 |2020年第2期| 共21页
作者
Pan Xingguang; Wang Shitong;
展开▼
作者单位

Jiangnan Univ Sch Digital Media Wuxi 214122 Jiangsu Peoples R China;

Jiangnan Univ Sch Digital Media Wuxi 214122 Jiangsu Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统 ;
关键词
Fuzzy c-means; feature reduction learning; marginal kurtosis measure; mean-to-variance ratio;

机译：模糊c-均值;特征约简学习;边缘峰度测度;均值方差比;

相似文献

外文文献
中文文献
专利

1. Evaluation of Organizational Learning Ability Based on Fuzzy C-means and Unascertained Measure [J] . Runliang, Wang, Zhiqiang, 系统科学与信息学报：英文版 . 2006 ,第004期
2. Background dominant colors extraction method based on color image quick fuzzy c-means clustering algorithm [J] . Zun-yang Liu, Feng Ding, Ying Xu, 兵工学报（英文版） . 2021 ,第005期
3. Partition region-based suppressed fuzzy C-means algorithm [J] . Kun Zhang, Weiren Kong, Peipei Liu, 系统工程与电子技术（英文版） . 2017 ,第005期
4. Development of slope mass rating system using K-means and fuzzy c-means clustering algorithms [J] . Jalali Zakaria 矿业科学技术（英文版） . 2016 ,第006期
5. The fuzzy C-means algorithm with fuzzy P-mode prototypes for clustering objects having mixed features [J] . Mahnhoon Lee, Witold Pedrycz Fuzzy sets and systems . 2009 ,第24期

机译：具有模糊P型原型的模糊C均值算法用于聚类具有混合特征的对象
6. Feature clustering and feature discretization assisting gene selection for molecular classification using fuzzy c-means and expectation-maximization algorithm [J] . Lin Hung-Yi Journal of supercomputing . 2021 ,第6期

机译：特征聚类和特征离散化辅助使用模糊C型方式和期望最大化算法进行分子分类的基因选择
7. Feature selection strategy based on hybrid crow search optimization algorithm integrated with chaos theory and fuzzy c-means algorithm for medical diagnosis problems [J] . Soft computing: A fusion of foundations, methodologies and applications . 2020 ,第3期

机译：基于混沌理论的混合乌布搜索优化算法的特征选择策略和模糊C型算法医学诊断问题
8. Feature reduction using fuzzy C-means clustering and Firefly algorithm [C] . Ako Ahmadi, Keyhan Khamforoosh International Conference on Computer and Knowledge Engineering . 2020

机译：使用模糊C-means聚类和萤火虫算法减少特征
9. Integrating information theory measures and a novel rule-set-reduction technique to improve fuzzy decision tree induction algorithms. [D] . Abu-halaweh, Na'el. 2010

机译：集成信息论的措施和一种新颖的规则集约简技术，以改进模糊决策树的归纳算法。
10. A New Validity Measure for a Correlation-Based Fuzzy C-means Clustering Algorithm [O] . Mingrui Zhang, Wei Zhang, Hugues Sicotte, -1

机译：基于相关性的模糊C-均值聚类算法的有效性检验
11. Comparison of Various Improved-Partition Fuzzy c-Means Clustering Algorithms in Fast Color Reduction [O] . 2016

机译：不同改进分区模糊c-均值聚类算法在快速减色中的比较
12. Fuzzy Robust Statistics for Application to the Fuzzy c-Means Clustering Algorithm [R] . Kersten, P. R. 1993

机译：模糊稳健统计量在模糊c-均值聚类算法中的应用

Feature reduction fuzzy C-Means algorithm leveraging the marginal kurtosis measure

摘要

著录项

相似文献

相关主题

期刊订阅