首页> 外文会议>ACM international conference on information and knowledge management >Scalable Clustering of Signed Networks Using Balance Normalized Cut
【24h】

Scalable Clustering of Signed Networks Using Balance Normalized Cut

机译:使用余额归一化割的签名网络可扩展群集

获取原文

摘要

We consider the general k-way clustering problem in signed social networks where relationships between entities can be either positive or negative. Motivated by social balance theory, the clustering problem in signed networks aims to find mutually antagonistic groups such that entities within the same group are friends with each other. A recent method proposed in [13] extended the spectral clustering algorithm to the signed network setting by considering the signed graph Laplacian. This has been shown to be equivalent to finding clusters that minimize the 2-way signed ratio cut. In this paper, we show that there is a fundamental weakness when we directly extend the signed Laplacian to the fc-way clustering problem. To overcome this weakness, we formulate new fc-way objectives for signed networks. In particular, we propose a criterion that is analogous to the normalized cut, called balance normalized cut, which is not only theoretically sound but also experimentally effective in fc-way clustering. In addition, we prove that these objectives are equivalent to weighted kernel fc-means objectives by choosing an appropriate kernel matrix. Employing this equivalence, we develop a multilevel clustering framework for signed networks. In this framework, we coarsen the graph level by level and refine the clustering results at each level via a fc-means based algorithm so that the signed clustering objectives are optimized. This approach gives good quality clustering results, and is also highly efficient and scalable. In experiments, we see that our multilevel approach is competitive to other state-of-the-art methods, while it is much faster and more scalable. In particular, the largest graph we have considered in our experiments contains 1 million nodes and 100 million edges - this graph can be clustered in less than four hundred seconds using our algorithm.
机译:我们考虑签名社会网络中的一般k路径聚类问题,其中实体之间的关系可以是积极的或消极的。受社会平衡理论的推动,签名网络中的聚类问题旨在找到相互对立的群体,以使同一群体内的实体成为彼此的朋友。文献[13]中提出的一种新方法是通过考虑有符号图拉普拉斯算子将频谱聚类算法扩展到有符号网络设置。已经证明,这等效于找到最小化2向有符号比率削减的聚类。在本文中,我们证明了当将带符号的拉普拉斯算子直接扩展到fc-way聚类问题时,存在一个根本性的弱点。为了克服这一弱点,我们为签名网络制定了新的fc-way目标。特别是,我们提出了一个类似于归一化割的标准,称为平衡归一化割,它不仅在理论上是合理的,而且在fc-way聚类中在实验上是有效的。另外,通过选择适当的内核矩阵,我们证明了这些目标等同于加权内核fc-means目标。利用这种等效性,我们为签名网络开发了一个多层次的聚类框架。在此框架中,我们逐级对图形进行粗化,并通过基于fc-means的算法细化每个级别的聚类结果,从而优化了带符号的聚类目标。这种方法可提供高质量的聚类结果,并且效率高且可扩展。在实验中,我们看到我们的多级方法与其他最新方法相比具有竞争优势,同时它的速度更快且可扩展性更高。特别是,我们在实验中考虑的最大图包含100万个节点和1亿条边-使用我们的算法,该图可以在不到四百秒的时间内聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号