Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach

机译：图对象的半监督聚类：子图挖掘方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Semi-supervised clustering has recently received a lot of attention in the literature, which aims to improve the clustering performance with limited supervision. Most existing semi-supervised clustering studies assume that the data is represented in a vector space, e.g., text and relational data. When the data objects have complex structures, e.g., proteins and chemical compounds, those semi-supervised clustering methods are not directly applicable to clustering such graph objects. In this paper, we study the problem of semi-supervised clustering of data objects which are represented as graphs. The supervision information is in the form of pairwise constraints of must-links and cannot-links. As there is no predefined feature set for the graph objects, we propose to use discriminative subgraph patterns as the features. We design an objective function which incorporates the constraints to guide the subgraph feature mining and selection process. We derive an upper bound of the objective function based on which, a branch-and-bound algorithm is proposed to speedup subgraph mining. We also introduce a redundancy measure into the feature selection process in order to reduce the redundancy in the feature set. When the graph objects are represented in the vector space of the discriminative subgraph features, we use semi-supervised kernel K-means to cluster all graph objects. Experimental results on real-world protein datasets demonstrate that the constraint information can effectively guide the feature selection and clustering process and achieve satisfactory clustering performance.

机译：半监督聚类最近在文献中引起了很多关注，其目的是在有限监督下提高聚类性能。大多数现有的半监督聚类研究都假设数据以向量空间表示，例如文本和关系数据。当数据对象具有复杂的结构（例如蛋白质和化合物）时，这些半监督聚类方法不能直接应用于聚类此类图形对象。在本文中，我们研究了以图形表示的数据对象的半监督聚类问题。监督信息采用必须链接和不能链接的成对约束形式。由于没有为图形对象设置预定义的特征，因此我们建议使用可区分的子图模式作为特征。我们设计了一个目标函数，其中包含了约束以指导子图特征的挖掘和选择过程。我们推导了目标函数的上限，在此基础上，提出了一种分支定界算法来加速子图的挖掘。我们还将冗余度量引入特征选择过程中，以减少特征集中的冗余。当图对象在可区分子图特征的向量空间中表示时，我们使用半监督核K均值对所有图对象进行聚类。在真实蛋白质数据集上的实验结果表明，约束信息可以有效地指导特征选择和聚类过程，并获得令人满意的聚类性能。

著录项

来源
《Database systems for advanced applications.;Part 1.》|2012年|p.197-212|共16页
会议地点 Busan(KR);Busan(KR)
作者
Xin Huang; Hong Cheng; Jiong Yang; Jeffery Xu Yu; Hongliang Fei; Jun Huan;
展开▼
作者单位

The Chinese University of Hong Kong;

The Chinese University of Hong Kong;

Case Western Reserve University;

The Chinese University of Hong Kong;

The Chinese University of Hong Kong;

University of Kansas;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;TP311.13;
关键词
semi-supervised clustering; frequent subgraph mining;

机译：半监督聚类；频繁的子图挖掘;

相似文献

外文文献
中文文献
专利

1. A truss-based approach for densest homogeneous subgraph mining in node-attributed graphs [J] . Sun Heli, Zhang Yawei, Jia Xiaolin, Computational Intelligence . 2021,第2期

机译：基于桁架归因于节点归属图中最均匀的子图挖掘的方法
2. An Approach to Semi-Supervised Co-Clustering With Side Information in Text Mining [J] . Rucha Bhutada, D.A.Borikar International Journal of Engineering Trends and Technology . 2015,第6期

机译：文本挖掘中带有辅助信息的半监督联合聚类方法
3. GAMer: a synthesis of subspace clustering and dense subgraph mining [J] . Stephan Gunnemann, Ines Farber, Brigitte Boden, Knowledge and information systems . 2014,第2期

机译：GAMer：子空间聚类和密集子图挖掘的综合
4. Efficient Mining of Combined Subspace and Subgraph Clusters in Graphs with Feature Vectors [C] . Stephan Guennemann, Brigitte Boden, Ines Faerber, Advances in knowledge discovery and data mining . 2013

机译：特征向量图中有效组合子空间和子图簇的挖掘
5. A relational database approach for frequent subgraph mining. [D] . Pradhan, Subhesh Kumar. 2006

机译：一种用于频繁子图挖掘的关系数据库方法。
6. RASMA: a reverse search algorithm for mining maximal frequent subgraphs [O] . Saeed Salem, Mohammed Alokshiya, Mohammad Al Hasan 2021

机译：RASMA：用于采矿最大频繁子图的反向搜索算法
7. Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach [O] . Xin Huang, Hong Cheng, Jiong Yang, 2012

机译：半监督图形对象的聚类：一个子画面挖掘方法

Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅