首页> 外文会议>Australian Joint Conference on Artificial Intelligence; 20041204-06; Cairns(AU) >Clustering Large Datasets Using Cobweb and K-Means in Tandem
【24h】

Clustering Large Datasets Using Cobweb and K-Means in Tandem

机译:串联使用Cobweb和K-Means聚类大型数据集

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a single scan algorithm for clustering large datasets based on a two phase process which combines two well known clustering methods. The Cobweb algorithm is modified to produce a balanced tree with subclusters at the leaves, and then K-means is applied to the resulting subclusters. The resulting method, Scalable Cobweb, is then compared to a single pass K-means algorithm and standard K-means. The evaluation looks at error as measured by the sum of squared error and vulnerability to the order in which data points are processed.
机译:本文提出了一种基于两阶段过程的大型集群数据集的单一扫描算法,该过程结合了两种众所周知的聚类方法。修改了Cobweb算法,以生成叶子处带有子簇的平衡树,然后将K-means应用于生成的子簇。然后将所得方法可伸缩蜘蛛网与单遍K均值算法和标准K均值进行比较。评估着眼于误差,该误差由平方误差和对数据点处理顺序的脆弱性的总和来衡量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号