首页> 外文会议>Knowledge Discovery and Data Mining, 2010. WKDD '10 >Hierarchical Agglomerative Clustering with Ordering Constraints
【24h】

Hierarchical Agglomerative Clustering with Ordering Constraints

机译:具有排序约束的分层聚集聚类

获取原文

摘要

Many previous researchers have converted background knowledge as constraints to obtain accurate clustering. These clustering methods are usually called constrained clustering. Previous ordering constraints are instance level non-hierarchical constraints, like must-link and cannot-link constraints, which do not provide hierarchical information. In order to incorporate the hierarchical background knowledge into agglomerative clustering, we extend instance-level constraint to hierarchical constraint in this paper. We name it as ordering constraint. Ordering constraints can be used to capture hierarchical side information and they allow the user to encode hierarchical knowledge such as ontologies into agglomerative algorithms. We experimented with ordering constraints on labeled newsgroup data. Experiments showed that the dendrogram generated by ordering constraints is more similar to the pre-known hierarchy than the dendrogram generated by previous agglomerative clustering algorithms. We believe this work will have a significant impact on the agglomerative clustering field.
机译:许多以前的研究人员已经将背景知识转换为约束,以获取准确的聚类。这些聚类方法通常称为约束聚类。先前的排序约束是实例级别的非层次约束,例如必须链接和不能链接约束,它们不提供层次信息。为了将层次背景知识整合到聚集聚类中,我们将实例级约束扩展为层次约束。我们将其命名为排序约束。排序约束可用于捕获分层的辅助信息,它们使用户可以将诸如本体之类的分层知识编码为凝聚算法。我们对带有标签的新闻组数据进行了排序约束实验。实验表明,通过排序约束生成的树状图比以前的聚集聚类算法生成的树状图更类似于已知的层次结构。我们相信这项工作将对凝聚集群领域产生重大影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号