K-means properties on six clustering benchmark datasets

Franti Pasi; Sieranoja Sami

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >K-means properties on six clustering benchmark datasets

【24h】

K-means properties on six clustering benchmark datasets

机译：K-均值六个聚类基准数据集上的属性

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper has two contributions. First, we introduce a clustering basic benchmark. Second, we study the performance of k-means using this benchmark. Specifically, we measure how the performance depends on four factors: (1) overlap of clusters, (2) number of clusters, (3) dimensionality, and (4) unbalance of cluster sizes. The results show that overlap is critical, and that k-means starts to work effectively when the overlap reaches 4% level.

机译：本文有两项贡献。首先，我们介绍群集基本基准。其次，我们研究了使用此基准测试的K-Meance的性能。具体而言，我们测量性能如何取决于四个因素：（1）簇重叠，（2）簇数，（3）维度和（4）簇大小的不平衡。结果表明，重叠是至关重要的，并且当重叠达到4％的级别时，k均值开始有效地工作。

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies》 |2018年第12期|共17页
作者
Franti Pasi; Sieranoja Sami;
展开▼
作者单位

Univ Eastern Finland Sch Comp Machine Learning Grp POB 111 FIN-80101 Joensuu Finland;

Univ Eastern Finland Sch Comp Machine Learning Grp POB 111 FIN-80101 Joensuu Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Clustering algorithms; Clustering quality; k-means; Benchmark;

机译：聚类算法;聚类质量;K-means;基准;

相似文献

外文文献
中文文献
专利

1. K-means properties on six clustering benchmark datasets [J] . Franti Pasi, Sieranoja Sami Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第12期

机译：K-均值六个聚类基准数据集上的属性
2. A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets [J] . Amir Ahmad, Lipika Dey Pattern recognition letters . 2011,第7期

机译：一种k均值类型聚类算法，用于混合数值和分类数据集的子空间聚类
3. Fuzzy K-means clustering with fast density peak clustering on multivariate kernel estimator with evolutionary multimodal optimization clusters on a large dataset [J] . G. Surya Narayana, Kamakshaiah Kolli Multimedia Tools and Applications . 2021,第3期

机译：具有大型数据集的传播多式化优化集群的多变核估计快速密度峰值聚类的模糊k均值聚类
4. An Improved Clustering Algorithm Based on k-Means and Artificial Bee Colony Optimization for Datasets that Contain Outliers [C] . Anu Balachandran, K.A. Abdul Nazeer 2018 International Conference on Computing, Power and Communication Technologies . 2018

机译：包含离群值的数据集的基于k均值和人工蜂群优化的改进聚类算法
5. Visual data mining: Using parallel coordinate plots with K-means clustering and color to find correlations in a multidimensional dataset. [D] . Peterson, Angela R. 2009

机译：可视数据挖掘：使用具有K均值聚类和颜色的平行坐标图来查找多维数据集中的相关性。
6. Canonical PSO Based K-Means Clustering Approach for Real Datasets [O] . Lopamudra Dey, Sanjay Chakraborty 2014

机译：基于规范PSO的真实数据集K-Means聚类方法
7. Analysis of Simple K-Mean and Parallel K-Mean Clustering for Software Products and Organizational Performance Using Education Sector Dataset [O] . Rui Shang, Balqees Ara, Islam Zada, 2021

机译：使用教育部门数据集分析软件产品和组织绩效的简单K均值和平行k平均聚类
8. Sampling Within k-Means Algorithm to Cluster Large Datasets [R] . Bejarano, J., Bose, K., Brannan, T., 2011

机译：在k-means算法中采样以聚类大数据集

K-means properties on six clustering benchmark datasets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅