...
首页> 外文期刊>Information Fusion >k-ANMI: A mutual information based clustering algorithm for categorical data
【24h】

k-ANMI: A mutual information based clustering algorithm for categorical data

机译:k-ANMI:基于互信息的分类数据聚类算法

获取原文
获取原文并翻译 | 示例

摘要

Clustering categorical data is an integral part of data mining and has attracted much attention recently. In this paper, we present k-ANMI, a new efficient algorithm for clustering categorical data. The k-ANMI algorithm works in a way that is similar to the popular k-means algorithm, and the goodness of clustering in each step is evaluated using a mutual information based criterion (namely, average normalized mutual information - ANMI) borrowed from cluster ensemble. This algorithm is easy to implement, requiring multiple hash tables as the only major data structure. Experimental results on real datasets show that k-ANMI algorithm is competitive with those state-of-the-art categorical data clustering algorithms with respect to clustering accuracy.
机译:聚类分类数据是数据挖掘不可或缺的一部分,最近引起了很多关注。在本文中,我们提出了k-ANMI,这是一种用于分类数据聚类的新有效算法。 k-ANMI算法的工作方式与流行的k-means算法类似,并且使用从聚类集成中借用的基于互信息的标准(即平均归一化互信息-ANMI)来评估每个步骤中的聚类的优缺点。 。该算法易于实现,需要多个哈希表作为唯一的主要数据结构。在真实数据集上的实验结果表明,就聚类准确性而言,k-ANMI算法与那些最新的分类数据聚类算法相比具有竞争优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号