首页> 外文会议>IEEE international conference on data engineering >Robust clustering of multi-type relational data via a heterogeneous manifold ensemble
【24h】

Robust clustering of multi-type relational data via a heterogeneous manifold ensemble

机译:通过异构歧管集合鲁棒聚类多型关系数据

获取原文

摘要

High-Order Co-Clustering (HOCC) methods have attracted high attention in recent years because of their ability to cluster multiple types of objects simultaneously using all available information. During the clustering process, HOCC methods exploit object co-occurrence information, i.e., inter-type relationships amongst different types of objects as well as object affinity information, i.e., intra-type relationships amongst the same types of objects. However, it is difficult to learn accurate intra-type relationships in the presence of noise and outliers. Existing HOCC methods consider the p nearest neighbours based on Euclidean distance for the intra-type relationships, which leads to incomplete and inaccurate intra-type relationships. In this paper, we propose a novel HOCC method that incorporates multiple subspace learning with a heterogeneous manifold ensemble to learn complete and accurate intra-type relationships. Multiple subspace learning reconstructs the similarity between any pair of objects that belong to the same subspace. The heterogeneous manifold ensemble is created based on two-types of intra-type relationships learnt using p-nearest-neighbour graph and multiple subspaces learning. Moreover, in order to make sure the robustness of clustering process, we introduce a sparse error matrix into matrix decomposition and develop a novel iterative algorithm. Empirical experiments show that the proposed method achieves improved results over the state-of-art HOCC methods for FScore and NMI.
机译:近年来,高阶共聚类(HOCC)方法由于能够使用所有可用信息而群集多种类型的对象,因此近年来引起了很高的关注。在聚类过程中,HOCC方法利用不同类型对象之间的对象共发生信息,即类型关系,以及对象亲和信息,即相同类型的对象之间的类型关系。但是,很难在存在噪声和异常值的情况下学习准确的内部关系。现有的HOCC方法认为基于欧几里德距离的P最终邻居进行型号的关系,这导致内含内关系不完整和不准确。在本文中,我们提出了一种新的HOCC方法,该方法包含多个子空间学习,具有异质流形集合,以学习完整和准确的内部关系。多个子空间学习重建属于同一子空间的任何对象之间的相似性。基于使用P-Collect邻图和多个子空间学习学习的两种类型的类型的关系来创建异构歧管组合。此外,为了确保聚类过程的稳健性,我们将稀疏的误差矩阵引入矩阵分解并开发一种新颖的迭代算法。实证实验表明,该方法达到了对FScore和NMI的最先进的HOCC方法改善了结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号