Fuzzy granular principal curves algorithm for large data sets

机译：大数据集的模糊粒状主曲线算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Principal curves, as a nonlinear generalization of principal components, are a common tool used in multivariate analysis for ends like dimensionality reduction and feature extraction. However, one of the difficulties that arise when utilizing this technique is that efficiency of existing principal curves algorithms is often low when dealing with large data set owing to high computational complexity. In the paper, a new method based on the idea of "information granulation and fuzzy sets" is proposed to improve efficiency and noise robustness. First, large amounts of numerical data are granulated into C interval (granular) data based on the fuzzy C-means cluster and two criteria of granulation, which significantly reduces the amount of data that is to be processed in the later step. Then granular principal curves are constructed according to the upper and the lower bounds of the interval data. Finally we introduce a quantitative index based on the parameter a to evaluate the fuzziness of granular principal curves output, where a is a positive parameter delivering some flexibility when optimizing the information granule. A series of numeric studies completed for synthetic data set provide a useful insight into the effectiveness of the proposed algorithm.

机译：作为主要组分的非线性概括的主要曲线是用于多变量分析的常用工具，其端部是二维性降低和特征提取。然而，在利用该技术时出现的困难之一是当由于高计算复杂度处理大数据集时，现有主曲线算法的效率通常很低。本文提出了一种基于“信息造粒和模糊集”思想的新方法，以提高效率和噪音鲁棒性。首先，基于模糊C-Means集群和两个肉芽标准将大量数值数据造成C间隔（粒度）数据，这显着降低了在后面的步骤中要处理的数据量。然后根据间隔数据的上限和下限构造粒状主曲线。最后，我们基于参数A引入定量指数，以评估粒状主曲线输出的模糊性，其中A是在优化信息颗粒时提供一些灵活性的正参数。为合成数据集完成的一系列数字研究提供了对所提出的算法的有效性的有用洞察力。

著录项

来源
《Joint IFSA World Congress and NAFIPS Annual Meeting》|2013年||共6页
会议地点
作者
Hongyun Zhang; Duoqian Miao; Witold Pedrycz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273.4-53;
关键词
granular principal curve; interval data; fuzzy C-mean cluster; fuzziness quantization;

机译：粒状主曲线;间隔数据;模糊C均值;模糊量化;

相似文献

外文文献
中文文献
专利

1. From Principal Curves to Granular Principal Curves [J] . Zhang H., Pedrycz W., Miao D., Cybernetics, IEEE Transactions on . 2014,第6期

机译：从主曲线到粒状主曲线
2. Multi-granularity principal curves extraction based on improved spectral clustering of complex distribution data [J] . Zhang Hongyun, Zhang Ting, Wang Peipei, 高分子論文集 . 2019,第FEBa期

机译：基于复杂分布数据光谱聚类的多粒度主曲线提取
3. A four-way decision-making approach using interval-valued fuzzy sets, rough set and granular computing: a new approach in data classification and decision-making [J] . Pritpal Singh, Yo-Ping Huang Granular Computing . 2020,第3期

机译：一种使用间隔值模糊集，粗糙集和粒度计算的四通决策方法：数据分类和决策中的新方法
4. Fuzzy granular principal curves algorithm for large data sets [C] . Zhang Hongyun, Miao Duoqian, Pedrycz Witold 2013 Joint IFSA World Congress and NAFIPS Annual Meeting . 2013

机译：大数据集的模糊粒状主曲线算法
5. Modifications of the Fuzzy -ARTMAP algorithm for distributed learning in large data sets [D] . Castro, Jose 2004

机译：大数据集中的分布式学习的Fuzzy -ARTMAP算法的修改
6. Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data [O] . Pugalendhi Ganesh Kumar, Muthu Subash Kavitha, Byeong-Cheol Ahn 2011

机译：使用基于模糊粗糙集的F信息和人类基因表达数据的水旋流算法相结合的癌症相关基因自动检测
7. A Genetic Algorithm for Feature Selection and Granularity Learning in Fuzzy Rule-Based Classification Systems for Highly Imbalanced Data-Sets [O] . Pedro Villar, Francisco Herrera 2014

机译：基于模糊规则的高度不平衡数据集分类系统的特征选择和粒度学习遗传算法
8. Seventh International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing (RSFDGrC'99) [R] . Zhong, N. , Skowron, A. , Ohsuga, S. 1999

机译：第七届粗糙集，模糊集，数据挖掘和粒度软计算国际研讨会（RsFDGrC'99）

Fuzzy granular principal curves algorithm for large data sets

摘要

著录项

相似文献

相关主题

期刊订阅