首页> 外文会议>European Conference on Principles and Practice of Knowledge Discovery in Databases(PKDD 2005); 20051003-07; Porto(PT) >Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results
【24h】

Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results

机译:带约束的聚集层次聚类:理论和经验结果

获取原文
获取原文并翻译 | 示例

摘要

We explore the use of instance and cluster-level constraints with ag-glomerative hierarchical clustering. Though previous work has illustrated the benefits of using constraints for non-hierarchical clustering, their application to hierarchical clustering is not straight-forward for two primary reasons. First, some constraint combinations make the feasibility problem (Does there exist a single feasible solution?) NP-complete. Second, some constraint combinations when used with traditional agglomerative algorithms can cause the dendrogram to stop prematurely in a dead-end solution even though there exist other feasible solutions with a significantly smaller number of clusters. When constraints lead to efficiently solvable feasibility problems and standard agglomerative algorithms do not give rise to dead-end solutions, we empirically illustrate the benefits of using constraints to improve cluster purity and average distortion. Furthermore, we introduce the new γ constraint and use it in conjunction with the triangle inequality to considerably improve the efficiency of agglomerative clustering.
机译:我们探索了实例和群集级别约束与团聚层次聚类的使用。尽管先前的工作已经说明了将约束用于非分层聚类的好处,但是由于两个主要原因,它们在分层聚类中的应用并不简单。首先,一些约束组合使可行性问题(是否存在一个可行的解决方案?)NP完全。第二,即使存在其他可行的具有明显更少簇的可行解决方案,当与传统的凝聚算法一起使用时,某些约束组合也可能导致树状图在死胡同中过早停止。当约束导致有效解决的可行性问题并且标准的凝聚算法没有产生死胡同的解决方案时,我们将通过经验说明使用约束来提高群集纯度和平均失真的好处。此外,我们引入了新的γ约束,并将其与三角不等式结合使用,以大大提高聚集聚类的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号