首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >Distributed Kronecker Graph Generation with Ground Truth of Many Graph Properties
【24h】

Distributed Kronecker Graph Generation with Ground Truth of Many Graph Properties

机译:分布式的Kronecker图表生成了许多图形属性的地面真实

获取原文

摘要

Computing various global and local topological graph features is an important facet of data analysis. To do so robustly and scalably requires efficient graph algorithms that either calculate topological features exactly or approximate topological features accurately. For this reason researchers developing distributed graph analytic algorithms desire generated graph benchmarks that share the challenging characteristics of real-world graphs (small-world, scale-free, heavy-tailed degree distribution) with efficiently calculated ground truth to the desired ouput. Given two small scale-free graphs with adjacency matrices A and B, their Kronecker product graph [1] has adjacency matrix C = A ? B. Such Nonstochastic Kronecker graphs are highly compressible, and many expensive global graph calculations can be computed in sublinear time, with local graph statistics computed exactly in linear time, both from a sublinear amount of storage. Therefore, this class of graphs are likely of high interest to those pursuing data analysis tasks that incorporate diverse graph-based features. Here, we extend previous results regarding local triangle statistics and demonstrate that ground truth Kronecker formulas apply to: (i) some distance-based vertex centrality metrics (vertex eccentricity and closeness centrality), (ii) internal and external edge density of communities. Moreover, we demonstrate several scaling laws apply that allow researchers to have control over various ground truth quantities.
机译:计算各种全局和本地拓扑图功能是数据分析的重要方面。为了使如此稳健地,可扩展地需要高效的图形算法,可以准确地计算拓扑功能或近似拓扑特征。由于这个原因,研究人员开发分布式图分析算法的欲望生成了与所需输出有效计算的地面真实性的真实世界图(小世界,无缝隙,重尾层分布)的具有挑战性的曲线基准。考虑到具有邻接矩阵A和B的两个小无垢图形,它们的Kronecker产品图[1]具有邻接矩阵C = A? B.这种非转换的Kronecker图是高度可压缩的,并且可以在Sublinear Time中计算许多昂贵的全局图形计算,本地图形统计数据在线性时间内精确地计算,无论是从Sublinear的存储量都能计算。因此,这类图表可能对追求包含不同图形的特征​​的数据分析任务很高。为此,我们需要有关当地三角统计以前的结果,并表明地面实况克罗内克公式适用于:(i)一些基于距离的顶点中心度量(顶点偏心率和接近中心),(二)内部和外部社区的边缘密度。此外,我们展示了几项缩放法律,允许研究人员控制各种地面真理数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号