...
【24h】

Active Sampling for Constrained Clustering

机译:约束聚类的主动采样

获取原文
获取原文并翻译 | 示例

摘要

Constrained clustering is a framework for improving clustering performance by using constraints about data pairs. Since performance of constrained clustering depends on the set of constraints used, a method is needed to select good constraints that promote clustering performance. In this paper, we propose an active sampling method working with a constrained cluster ensemble algorithm that aggregates clustering results that a modified COP-Kmeans iteratively produces by changing the priorities of constraints. Our method follows the approach of uncertainty sampling and measures uncertainty using variations of clustering results where data pairs are clustered together in some results but not in others. It selects the data pair to be labeled that has the most variable result during cluster ensemble process. Experimental results show that our method outperforms random sampling. We further investigate the effect of important parameters.
机译:约束聚类是通过使用有关数据对的约束来提高聚类性能的框架。由于约束聚类的性能取决于所使用的约束集,因此需要一种方法来选择能够提高聚类性能的良好约束。在本文中,我们提出了一种与约束聚类集成算法一起使用的主动采样方法,该算法聚合了通过更改约束的优先级而迭代生成的改进的COP-Kmeans聚类结果。我们的方法遵循不确定性抽样的方法,并使用聚类结果的变化来测量不确定性,其中数据对在某些结果中聚类在一起,而在其他结果中不聚类。它选择在集群集成过程中结果变化最大的要标记的数据对。实验结果表明,我们的方法优于随机抽样。我们进一步研究重要参数的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号