首页> 外文期刊>Computational Intelligence >DOCUMENT CLUSTERING WITH DUAL SUPERVISION THROUGH FEATURE REWEIGHTING
【24h】

DOCUMENT CLUSTERING WITH DUAL SUPERVISION THROUGH FEATURE REWEIGHTING

机译:通过功能加权实现双重监督的文档聚类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Traditional semi-supervised clustering uses only limited user supervision in the form of instance seeds for clusters and pairwise instance constraints to aid unsupervised clustering. However, user supervision can also be provided in alternative forms for document clustering, such as labeling a feature by indicating whether it discriminates among clusters. This article thus fills this void by enhancing traditional semi-supervised clustering with feature supervision, which asks the user to label discriminating features during defining (labeling) the instance seeds or pairwise instance constraints. Various types of semi-supervised clustering algorithms were explored with feature supervision. Our experimental results on several real-world data sets demonstrate that augmenting the instance-level supervision with feature-level supervision can significantly improve document clustering performance.
机译:传统的半监督群集仅使用实例种子形式的受限用户监视群集,并使用成对实例约束来辅助无监督群集。但是,也可以以其他形式为文档聚类提供用户监管,例如通过指示特征是否在聚类之间进行区分来标记特征。因此,本文通过使用功能监督增强了传统的半监督聚类来填补了这一空白,该功能要求用户在定义(标记)实例种子或成对实例约束时标记区别的功能。通过特征监督探索了各种类型的半监督聚类算法。我们在多个实际数据集上的实验结果表明,使用功能级别的监督来增强实例级别的监督可以显着提高文档聚类的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号