首页> 外文会议>International joint conference on rough sets >A Sequential Three-Way Approach to Constructing a Co-association Matrix in Consensus Clustering
【24h】

A Sequential Three-Way Approach to Constructing a Co-association Matrix in Consensus Clustering

机译:共识聚类中一种顺序三向构建协关联矩阵的方法

获取原文

摘要

The main task in consensus clustering is to produce an optimal output clustering based on a set of input clusterings. The co-association matrix based consensus clustering methods are easy to understand and implement. However, they usually have high computational cost with big datasets, which restricts their applications. We propose a sequential three-way approach to constructing the co-association matrix progressively in multiple stages. In each stage, based on a set of input clusterings, we evaluate how likely two data points are associated and accordingly, divide a set of data-point pairs into three disjoint positive, negative and boundary regions. A data-point pair in the positive region is associated with a definite decision of clustering the two data points together. A pair in the negative region is associated with a definite decision of separating the two data points into different clusters. For a pair in the boundary region, we do not have sufficient information to make a definite decision. The decision on such a pair is deferred into the next stage where more input clusterings will be involved. By making quick decisions on early stages, the overall computational cost of constructing the matrix and the consensus clustering may be reduced.
机译:共识聚类的主要任务是基于一组输入聚类来产生最佳输出聚类。基于协同矩阵的共识聚类方法易于理解和实现。但是,对于大型数据集,它们通常具有很高的计算成本,这限制了它们的应用。我们提出了一种顺序三路方法来逐步构建多阶段的协同关联矩阵。在每个阶段中,基于一组输入聚类,我们评估两个数据点之间关联的可能性,因此,将一组数据点对划分为三个不相交的正,负和边界区域。正区域中的数据点对与将两个数据点聚类在一起的确定决策相关联。负区域中的一对与将两个数据点分为不同簇的确定决策相关联。对于边界区域中的一对,我们没有足够的信息来做出确定的决定。对这样的对的决定被推迟到下一阶段,在该阶段将涉及更多的输入聚类。通过在早期阶段做出快速决策,可以减少构建矩阵和共识聚类的总体计算成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号