...
首页> 外文期刊>IEEE Transactions on Fuzzy Systems >Semisupervised Fuzzy Clustering With Partition Information of Subsets
【24h】

Semisupervised Fuzzy Clustering With Partition Information of Subsets

机译:具有子集分区信息的半监督模糊聚类

获取原文
获取原文并翻译 | 示例

摘要

Pairwise constraint is a type of side information that is widely considered in existing semisupervised clustering approaches. In this paper, we explore a new form of supervision for clustering. We consider the partition results of a number of subsets as additional information to assist clustering. Compared to the pairwise constraint, which only involves the "must-link" or "cannot-link" relationship of two objects, the partition of a subset of objects provides information about the group structure of more objects and hence can possibly serve as a more effective form of supervision for clustering. In this paper, we instantiate the idea of clustering with subset partitions under the fuzzy clustering framework for document categorization. The proposed fuzzy clustering approach is formulated to learn from the partition of subsets and has the ability to handle high-dimensional document data. Specifically, the partition results of subsets are collectively transformed into pairwise relationships, based on which a penalty term is constructed and incorporated into a cosine-distance-based fuzzy c-means approach. The experimental results on benchmark data sets demonstrate the effectiveness of the proposed approach for a semisupervised document clustering.
机译:成对约束是在现有的半监督聚类方法中广泛考虑的一种辅助信息。在本文中,我们探索了一种新的集群监管形式。我们将许多子集的分区结果视为辅助聚类的附加信息。与仅涉及两个对象的“必须链接”或“不能链接”关系的成对约束相比,对象子集的分区提供了有关更多对象的组结构的信息,因此可能充当更多对象的组结构。集群监督的有效形式。在本文中,我们实例化了在模糊聚类框架下使用子集分区进行聚类的思想,用于文档分类。拟定的模糊聚类方法旨在从子集的划分中学习,并具有处理高维文档数据的能力。具体而言,将子集的划分结果集中转换为成对关系,基于此关系构造惩罚项并将其纳入基于余弦距离的模糊c均值方法。在基准数据集上的实验结果证明了该方法在半监督文档聚类中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号