【24h】

Active sampling for constrained clustering

机译:主动采样用于约束聚类

获取原文

摘要

Constrained Clustering is a framework of improving clustering performance by using supervised information, which is generally a set of constraints about data pairs. Since performance of constrained clustering depends on a set of constraints to use, we need a method to select good constraints that are expected to promote clustering performance. In this paper, we propose such a method, which actively select data pairs to be constrained by using variance of clustering iteration. This method consists of a bagging based cluster ensemble algorithm that integrates a set of clusters produced by a constrained k-means with random ordered data assignment. Experimental results show that our method outperforms clustering with random sampling method.
机译:约束聚类是通过使用监督信息(通常是关于数据对的一组约束)来提高聚类性能的框架。由于约束聚类的性能取决于要使用的一组约束,因此我们需要一种方法来选择预期会提高聚类性能的良好约束。在本文中,我们提出了一种通过聚类迭代的方差来主动选择要约束的数据对的方法。该方法由基于装袋的聚类集成算法组成,该算法将受约束的k均值产生的一组聚类与随机有序数据分配进行集成。实验结果表明,该方法优于随机抽样聚类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号