首页> 外文会议>International conference on advanced data mining and applications >Topic-Constrained Hierarchical Clustering for Document Datasets
【24h】

Topic-Constrained Hierarchical Clustering for Document Datasets

机译:文档数据集的主题约束分层群集

获取原文

摘要

In this paper, we propose the topic-constrained hierarchical clustering, which organizes document datasets into hierarchical trees con-sistant with a given set of topics. The proposed algorithm is based on a constrained agglomerative clustering framework and a semi-supervised criterion function that emphasizes the relationship between documents and topics and the relationship among documents themselves simultaneously. The experimental evaluation show that our algorithm outperformed the traditional agglomerative algorithm by 7.8% to 11.4%.
机译:在本文中,我们提出了主题约束的分层群集,该聚类将文档数据集组织成具有给定主题集的分层树Con-sistant。所提出的算法基于受约束的聚类聚类框架和半监督标准函数,它强调了文档与主题之间的关系以及同时文档本身之间的关系。实验评估表明,我们的算法优于传统的凝聚算法的表现优于7.8%至11.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号