A weighted multivariate Fuzzy C-Means method in interval-valued scientific production data

Bruno Almeida Pimentel; Renata M.C.R. de Souza

首页> 外文期刊>Expert Systems with Application >A weighted multivariate Fuzzy C-Means method in interval-valued scientific production data

【24h】

A weighted multivariate Fuzzy C-Means method in interval-valued scientific production data

机译：区间值科学生产数据中的加权多元模糊C-均值方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clustering is the process of organizing objects into groups whose members are similar in some way. Most of the clustering methods involve numeric data only. However, this representation may not be adequate to model complex information which may be: histogram, distributions, intervals. To deal with these types of data, Symbolic Data Analysis (SDA) was developed. In multivariate data analysis, it is common some variables be more or less relevant than others and less relevant variables can mask the cluster structure. This work proposes a clustering method based on fuzzy approach that produces weighted multivariate memberships for interval-valued data. These memberships can change at each iteration of the algorithm and they are different from one variable to another and from one cluster to another. Furthermore, there is a different relevance weight associated to each variable that may also be different from one cluster to another. The advantage of this method is that it is robust to ambiguous cluster membership assignment since weights represent how important the different variables are to the clusters. Experiments are performed with synthetic data sets to compare the performance of the proposed method against other methods already established by the clustering literature. Also, an application with interval-valued scientific production data is presented in this work. Clustering quality results have shown that the proposed method offers higher accuracy when variables have different variabilities.

机译：群集是将对象组织成成员在某种程度上相似的组的过程。大多数聚类方法仅涉及数字数据。但是，此表示可能不足以对复杂的信息进行建模，这些信息可能是：直方图，分布，区间。为了处理这些类型的数据，开发了符号数据分析（SDA）。在多变量数据分析中，常见的是某些变量比其他变量或多或少具有相关性，而较少相关的变量可以掩盖聚类结构。这项工作提出了一种基于模糊方法的聚类方法，该方法为区间值数据生成加权多元成员资格。这些成员资格可以在算法的每次迭代中更改，并且它们从一个变量到另一个变量以及从一个群集到另一个群集都是不同的。此外，存在与每个变量相关联的不同的相关性权重，该相关性权重也可能在一个群集与另一个群集之间不同。该方法的优点是，它对歧义的群集成员资格分配具有鲁棒性，因为权重表示不同变量对群集的重要性。用合成数据集进行实验，以将所提出的方法的性能与聚类文献已经建立的其他方法进行比较。此外，这项工作还介绍了具有区间值的科学生产数据的应用程序。聚类质量结果表明，当变量具有不同的可变性时，该方法具有较高的准确性。

著录项

来源
《Expert Systems with Application 》 |2014年第7期| 3223-3236| 共14页
作者
Bruno Almeida Pimentel; Renata M.C.R. de Souza;
展开▼
作者单位

Universidade Federal de Pernambuco (UFPE), Centra de Informatica (CIn), Av. Jomalista Anibal Fernandes, s - Cidade Universitaria, 50.740-560 Recife. PE, Brazil;

Universidade Federal de Pernambuco (UFPE), Centra de Informatica (CIn), Av. Jomalista Anibal Fernandes, s - Cidade Universitaria, 50.740-560 Recife. PE, Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering; Fuzzy C-Means method; Symbolic Data Analysis; Weighted multivariate membership; Scientific production data;

机译：集群;模糊C均值法符号数据分析;加权多元隶属度;科学生产数据;

相似文献

外文文献
中文文献
专利

1. Application of semi-supervised fuzzy c-means method in clustering multivariate geochemical data, a case study from the Dalli Cu-Au porphyry deposit in central Iran [J] . Fatehi Moslem, Asadi Hooshang H. Ore Geology Reviews: Journal for Comprehensive Studies of Ore Genesis and Ore Exploration . 2017 ,第Pta1期

机译：半监督模糊C-型方法在聚类多元化地球化学数据中的应用，伊朗中部达利Cu-Au斑岩矿床案例研究
2. Towards hybrid clustering approach to data classification: Multiple kernels based interval-valued Fuzzy C-Means algorithms [J] . Dzung Dinh Nguyen, Long Thanh Ngo, Long The Pham, Fuzzy sets and systems . 2015 ,第NOVa15期

机译：迈向数据分类的混合聚类方法：基于多个核的区间值模糊C均值算法
3. A New Decision Making Method Using Interval-Valued Intuitionistic Fuzzy Cosine Similarity Measure Based on the Weighted Reduced Intuitionistic Fuzzy Sets [J] . Rajkumar VERMA, Jose M. MERIGO Informatica . 2020 ,第2期

机译：一种新的决策方法，采用基于加权减少直觉模糊集的间隔值直觉模糊余弦相似度量
4. Partitioning Fuzzy C-Means Clustering Algorithms for Interval-Valued Data Based on City-Block Distances [C] . de A T de Carvalho F., Barbosa G.B.N., Pimentel J.T. Brazilian Conference on Intelligent Systems . 2013

机译：基于城市间距离的区间值数据模糊C均值聚类算法
5. Development of Texture Weighted Fuzzy C-Means Algorithm for 3D Brain MRI Segmentation [D] . Lee, Ji Young. 2018

机译：用于3D脑MRI分割的纹理加权模糊C均值算法的开发
6. An Integrated Fuzzy C-Means Method for Missing Data Imputation Using Taxi GPS Data [O] . Junsheng Huang, Baohua Mao, Yun Bai, 2020

机译：出租车GPS数据的数据归因综合模糊C均值方法
7. A New Decision Making Method Using Interval-Valued Intuitionistic Fuzzy Cosine Similarity Measure Based on the Weighted Reduced Intuitionistic Fuzzy Sets [O] . Rajkumar Verma, José M. Merigó 2020

机译：一种新的决策方法，采用基于加权减少直觉模糊集的间隔值直觉模糊余弦相似度量
8. DEVELOPMENT AND TESTING OF METHODS BASED ON MULTIVARIATE STATISTICS AND SIGNAL PROCESSING FOR COMPACTION OF LARGE SETS OF SCIENTIFIC MEASUREMENT DATA [R] . Robert K. Otnes, Loren Enochson, N. Roy Goodman 1974

机译：基于多元统计和信号处理的大型科学测量数据压缩方法的开发和测试

A weighted multivariate Fuzzy C-Means method in interval-valued scientific production data

摘要

著录项

相似文献

相关主题

期刊订阅