首页> 外文期刊>The VLDB journal >Efficient structural node similarity computation on billion-scale graphs
【24h】

Efficient structural node similarity computation on billion-scale graphs

机译:有效的结构节点相似性计算亿尺度图形

获取原文
获取原文并翻译 | 示例
       

摘要

Structural node similarity is widely used in analyzing complex networks. As one of the structural node similarity metrics, role similarity has the good merit of indicating automorphism (isomorphism). Existing algorithms to compute role similarity (e.g., RoleSim and NED) suffer from severe performance bottlenecks and thus cannot handle large real-world graphs. In this paper, we propose a new framework, namely StructSim, to compute nodes' role similarity. Under this framework, we first prove that StructSim is an admissible role similarity metric based on the maximum matching. While the maximum matching is still too costly to scale, we then devise the BinCount matching that not only is efficient to compute but also guarantees the admissibility of StructSim. BinCount-based StructSim admits a precomputed index to query a single pair of node in O(k log D) time, where k is a small user-defined parameter and D is the maximum node degree. To build the index, we further devise an FM-sketch-based technique that can handle graphs with billions of edges. Extensive empirical studies show that StructSim performs much better than the existing works regarding both effectiveness and efficiency when applied to compute structural node similarities on the real-world graphs.
机译:结构节点相似度广泛用于分析复杂网络。作为结构节点相似度量之一,角色相似性具有指示自同构型(同构)的良好优点。计算角色相似性的现有算法(例如,横向和NED)遭受严重的性能瓶颈,因此无法处理大型真实图。在本文中,我们提出了一个新的框架,即结构范围,计算节点的角色相似度。在此框架下,我们首先证明了结构范围是基于最大匹配的可允许的角色相似度指标。虽然规模的最大匹配仍然太昂贵,但我们设计的Bincount匹配不仅有效地计算,而且保证了结构的可否受理。基于Bincount的Structsim承认预先计算的索引在O(k log d)时间内查询单对节点,其中k是小用户定义的参数,d是最大节点度。为了构建索引,我们进一步设计了一种基于FM-Sketch的技术,可以处理数十亿边缘的图形。广泛的实证研究表明,当应用于计算真实世界图中的结构节点相似性时,STRACHSIM比现有的有效性和效率的工作更好。

著录项

  • 来源
    《The VLDB journal》 |2021年第3期|471-493|共23页
  • 作者单位

    Univ New South Wales Sydney NSW Australia;

    Univ New South Wales Sydney NSW Australia|Alibaba Grp Hangzhou Peoples R China;

    Univ Technol Sydney Ctr AI Sydney NSW Australia;

    Univ New South Wales Sydney NSW Australia|East China Normal Univ Shanghai Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Node similarity; Role similarity; Efficiency; Link analysis;

    机译:节点相似度;角色相似性;效率;链接分析;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号