首页> 外文会议>Australian Joint Conference on Artificial Intelligence; 20041204-06; Cairns(AU) >Clustering Large Datasets Using Cobweb and K-Means in Tandem

【24h】

Clustering Large Datasets Using Cobweb and K-Means in Tandem

机译：串联使用Cobweb和K-Means聚类大型数据集

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a single scan algorithm for clustering large datasets based on a two phase process which combines two well known clustering methods. The Cobweb algorithm is modified to produce a balanced tree with subclusters at the leaves, and then K-means is applied to the resulting subclusters. The resulting method, Scalable Cobweb, is then compared to a single pass K-means algorithm and standard K-means. The evaluation looks at error as measured by the sum of squared error and vulnerability to the order in which data points are processed.

机译：本文提出了一种基于两阶段过程的大型集群数据集的单一扫描算法，该过程结合了两种众所周知的聚类方法。修改了Cobweb算法，以生成叶子处带有子簇的平衡树，然后将K-means应用于生成的子簇。然后将所得方法可伸缩蜘蛛网与单遍K均值算法和标准K均值进行比较。评估着眼于误差，该误差由平方误差和对数据点处理顺序的脆弱性的总和来衡量。

著录项

来源
《Australian Joint Conference on Artificial Intelligence; 20041204-06; Cairns(AU) 》|2004年|P.368-379|共12页
会议地点 Cairns(AU)
作者
Mi Li; Geoffrey Holmes; Bernhard Pfahringer;
展开▼
作者单位

Department of Computer Science, University of Waikato, Hamilton, New Zealand;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论 ;
关键词

相似文献

外文文献
中文文献
专利

1. A hybrid reciprocal model of PCA and K-means with an innovative approach of considering sub-datasets for the improvement of K-means initialization and step-by-step labeling to create clusters with high interpretability [J] . Anaraki Seyed Alireza Mousavian, Haeri Abdorrahman, Moslehi Fateme Pattern Analysis and Applications . 2021 ,第3期

机译：具有创新方法的PCA和K-in的混合互惠模型，其考虑子数据集改进K-Means初始化和逐步标记，以创建具有高可解释性的群集
2. A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets [J] . Amir Ahmad, Lipika Dey Pattern recognition letters . 2011 ,第7期

机译：一种k均值类型聚类算法，用于混合数值和分类数据集的子空间聚类
3. Fuzzy K-means clustering with fast density peak clustering on multivariate kernel estimator with evolutionary multimodal optimization clusters on a large dataset [J] . G. Surya Narayana, Kamakshaiah Kolli Multimedia Tools and Applications . 2021 ,第3期

机译：具有大型数据集的传播多式化优化集群的多变核估计快速密度峰值聚类的模糊k均值聚类
4. Clustering Large Datasets Using Cobweb and K-Means in Tandem [C] . Mi Li, Geoffrey Holmes, Bernhard Pfahringer Australian Joint Conference on Artificial Intelligence . 2004

机译：使用COBWEB和K-Meanse在串联中聚类大型数据集
5. Visual data mining: Using parallel coordinate plots with K-means clustering and color to find correlations in a multidimensional dataset. [D] . Peterson, Angela R. 2009

机译：可视数据挖掘：使用具有K均值聚类和颜色的平行坐标图来查找多维数据集中的相关性。
6. Canonical PSO Based K-Means Clustering Approach for Real Datasets [O] . Lopamudra Dey, Sanjay Chakraborty 2014

机译：基于规范PSO的真实数据集K-Means聚类方法
7. Clustering large datasets using cobweb and K-means in tandem [O] . Li Mi, Holmes Geoffrey, Pfahringer Bernhard 2005

机译：串联使用蛛网和K-means聚类大型数据集
8. Sampling Within k-Means Algorithm to Cluster Large Datasets [R] . Bejarano, J., Bose, K., Brannan, T., 2011

机译：在k-means算法中采样以聚类大数据集

Clustering Large Datasets Using Cobweb and K-Means in Tandem

摘要

著录项

相似文献

相关主题

期刊订阅