...
首页> 外文期刊>Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on >CA-Tree: A Hierarchical Structure for Efficient and Scalable Coassociation-Based Cluster Ensembles
【24h】

CA-Tree: A Hierarchical Structure for Efficient and Scalable Coassociation-Based Cluster Ensembles

机译:CA树:高效和可伸缩的基于协作的集群集成的层次结构

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Cluster ensembles have attracted a lot of research interests in recent years, and their applications continue to expand. Among the various algorithms for cluster ensembles, those based on coassociation matrices are probably the ones studied and used the most because coassociation matrices are easy to understand and implement. However, the main limitation of coassociation matrices as the data structure for combining multiple clusterings is the complexity that is at least quadratic to the number of patterns $N$. In this paper, we propose CA-tree, which is a dendogram-like hierarchical data structure, to facilitate efficient and scalable cluster ensembles for coassociation-matrix-based algorithms. All the properties of the CA-tree are derived from base cluster labels and do not require the access to the original data features. We then apply a threshold to the CA-tree to obtain a set of nodes, which are then used in place of the original patterns for ensemble-clustering algorithms. The experiments demonstrate that the complexity for coassociation-based cluster ensembles can be reduced to close to linear to $N$ with minimal loss on clustering accuracy.
机译:近年来,团簇合奏吸引了许多研究兴趣,并且它们的应用不断扩展。在各种集群集成算法中,基于协关联矩阵的算法可能是最受研究和使用的算法,因为协关联矩阵易于理解和实现。然而,作为用于组合多个聚类的数据结构的协关联矩阵的主要局限性是复杂度,该复杂度至少是模式$ N $的数量的二次方。在本文中,我们提出了CA树,这是一种类似于树状图的分​​层数据结构,以促进基于协关联矩阵的算法的高效且可扩展的集群集成。 CA树的所有属性都是从基本群集标签派生的,不需要访问原始数据功能。然后,我们将阈值应用于CA树以获取一组节点,然后将这些节点代替原始模式用于集合聚类算法。实验表明,基于协作的聚类集成的复杂度可以降低到接近线性到$ N $,并且聚类精度损失最小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号