Two density-based k-means initialization algorithms for non-metric data clustering

Bianchi Filippo Maria; Livi Lorenzo; Rizzi Antonello

首页> 外文期刊>Pattern Analysis and Applications >Two density-based k-means initialization algorithms for non-metric data clustering

【24h】

Two density-based k-means initialization algorithms for non-metric data clustering

机译：非度量数据聚类的两种基于密度的k均值初始化算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a density-based clusters' representatives selection algorithm that identifies the most central patterns from the dense regions in the dataset. The method, which has been implemented using two different strategies, is applicable to input spaces with no trivial geometry. Our approach exploits a probability density function built through the Parzen estimator, which relies on a (not necessarily metric) dissimilarity measure. Being a representatives extractor a general-purpose algorithm, our method is obviously applicable in different contexts. However, to test the proposed procedure, we specifically consider the problem of initializing the k-means algorithm. We face problems defined on standard real-valued vectors, labeled graphs, and finally sequences of real-valued vectors and sequences of characters. The obtained results demonstrate the effectiveness of the proposed representative selection method with respect to other state-of-the-art solutions.

机译：在本文中，我们提出了一种基于密度的聚类代表选择算法，该算法从数据集中的密集区域中识别出最中心的模式。该方法已使用两种不同的策略实施，适用于没有平凡几何形状的输入空间。我们的方法利用了通过Parzen估计器建立的概率密度函数，该函数依赖于（不一定是度量）不相似性度量。作为一种通用算法的代表提取器，我们的方法显然适用于不同的环境。但是，为了测试建议的过程，我们特别考虑了初始化k-means算法的问题。我们面临的问题是在标准实值向量，带标签的图以及最终的实值向量序列和字符序列上定义的问题。获得的结果证明了所提出的代表性选择方法相对于其他最新解决方案的有效性。

著录项

来源
《Pattern Analysis and Applications》 |2016年第3期|745-763|共19页
作者
Bianchi Filippo Maria; Livi Lorenzo; Rizzi Antonello;
展开▼
作者单位

SAPIENZA Univ Rome, Dept Informat Engn Elect & Telecommun, Via Eudossiana 18, I-00184 Rome, Italy;

SAPIENZA Univ Rome, Dept Informat Engn Elect & Telecommun, Via Eudossiana 18, I-00184 Rome, Italy;

SAPIENZA Univ Rome, Dept Informat Engn Elect & Telecommun, Via Eudossiana 18, I-00184 Rome, Italy;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering; Prototype selection; k-means initialization; Dissimilarity measures; Non-metric domains;

机译：聚类;原型选择;k-均值初始化;相异性度量;非度量域;

相似文献

外文文献
中文文献
专利

1. An initial seed selection algorithm for k-means clustering of georeferenced data to improve replicability of cluster assignments for mapping application [J] . Fouad Khan Applied Soft Computing . 2012,第11期

机译：用于地理参考数据的k均值聚类的初始种子选择算法，以提高用于地图绘制应用的聚类分配的可复制性
2. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
3. Novel density-based and hierarchical density-based clustering algorithms for uncertain data [J] . Zhang Xianchao, Liu Han, Zhang Xiaotong Neural Networks: The Official Journal of the International Neural Network Society . 2017,第期

机译：基于新的基于密度和分层密度的基于分层密度的不确定数据集群算法
4. A Density-Based Method for Selection of the Initial Clustering Centers of K-means Algorithm [C] . Xin Du, Ning Xu, Cailan Zhou, IEEE Advanced Information Technology, Electronic and Automation Control Conference . 2017

机译：基于密度的选择方法，用于选择K-Means算法的初始聚类中心
5. Clustering educational digital library usage data: Comparisons of latent class analysis and K-means algorithms [D] . Xu, Beijie 2011

机译：聚集教育数字图书馆使用数据：潜在类别分析和K-means算法的比较
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. An Initial Seed Selection Algorithm for K-means Clustering of Georeferenced Data to Improve Replicability of Cluster Assignments for Mapping Application [O] . Khan, Fouad 2016

机译：一种用于K-means聚类的初始种子选择算法地理参考数据提高集群分配的可复制性制图应用

Two density-based k-means initialization algorithms for non-metric data clustering

摘要

著录项

相似文献

相关主题

期刊订阅