首页> 外文会议>International conference on management of data >Local Graph Sparsification for Scalable Clustering
【24h】

Local Graph Sparsification for Scalable Clustering

机译:可伸缩聚类的本地图形稀疏化

获取原文

摘要

In this paper we look at how to sparsify a graph i.e. how to reduce the edgeset while keeping the nodes intact, so as to enable faster graph clustering without sacrificing quality. The main idea behind our approach is to preferentially retain the edges that are likely to be part of the same cluster. We propose to rank edges using a simple similarity-based heuristic that we efficiently compute by comparing the minhash signatures of the nodes incident to the edge. For each node, we select the top few edges to be retained in the sparsified graph. Extensive empirical results on several real networks and using four state-of-the-art graph clustering and community discovery algorithms reveal that our proposed approach realizes excellent speedups (often in the range 10-50), with little or no deterioration in the quality of the resulting clusters. In fact, for at least two of the four clustering algorithms, our sparsification consistently enables higher clustering accuracies.
机译:在本文中,我们看看如何缩小图形,即如何减少边缘,同时保持节点完好无损,以便在不牺牲质量的情况下实现更快的图形聚类。我们背后的主要思想是优先保留可能成为同一群集的一部分的边缘。我们建议使用基于简单的相似性的启发式来排序边缘,以通过比较事件到边缘的节点的minhash签名来有效地计算。对于每个节点,我们选择要保留在稀疏图中的顶部几个边。在几个真实网络和使用四个最先进的图形聚类和社区发现算法上的广泛的经验结果表明,我们的提出方法实现了优异的加速(通常在10-50范围内),少或根本没有恶化由此产生的簇。事实上,对于四种聚类算法中的至少两个,我们的稀疏始终能够实现更高的聚类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号