首页> 外文期刊>Data Mining and Knowledge Discovery >Using instance-level constraints in agglomerative hierarchical clustering: theoretical and empirical results
【24h】

Using instance-level constraints in agglomerative hierarchical clustering: theoretical and empirical results

机译:在聚合层次聚类中使用实例级约束:理论和经验结果

获取原文
获取原文并翻译 | 示例
           

摘要

Clustering with constraints is a powerful method that allows users to specify background knowledge and the expected cluster properties. Significant work has explored the incorporation of instance-level constraints into non-hierarchical clustering but not into hierarchical clustering algorithms. In this paper we present a formal complexity analysis of the problem and show that constraints can be used to not only improve the quality of the resultant dendrogram but also the efficiency of the algorithms. This is particularly important since many agglomerative style algorithms have running times that are quadratic (or faster growing) functions of the number of instances to be clustered. We present several bounds on the improvement in the running times of algorithms obtainable using constraints.
机译:带约束的聚类是一种功能强大的方法,它允许用户指定背景知识和预期的聚类属性。大量工作探索了将实例级约束合并到非分层群集中,而不是合并到分层群集算法中。在本文中,我们对问题进行了形式上的复杂性分析,并表明约束不仅可以用来提高生成的树状图的质量,而且可以提高算法的效率。这一点特别重要,因为许多聚集样式算法的运行时间是要群集的实例数量的二次函数(或增长速度更快)。我们提出了使用约束可获得的算法运行时间改进方面的几个界限。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号