【24h】

Improved Triangle Counting in Graph Streams: Power of Multi-Sampling

机译:图形流中改进的三角计数:多重采样的功能

获取原文

摘要

Some of the well known streaming algorithms to estimate number of triangles in a graph stream work as follows: Sample a single triangle with high enough probability and repeat this basic step to obtain a global triangle count. For example, the algorithm due to Buriol et al. (PODS 2006) uniformly at random picks a single vertex v and a single edge e and checks whether the two cross edges that connect v to e appear in the stream. Similarly, the neighborhood sampling algorithm (PVLDB 2013) attempts to sample a triangle by randomly choosing a single vertex v, a single neighbor u of v and waits for a third edge that completes the triangle. In both the algorithms, the basic sampling step is repeated multiple times to obtain an estimate for the global triangle count in the input graph stream. In this work, we propose a multi-sampling variant of these algorithms: In case of Buriol et al's algorithm, instead of randomly choosing a single vertex and edge, randomly sample multiple vertices and multiple edges and collect cross edges that connect sampled vertices to the sampled edges. In case of neighborhood sampling algorithm, randomly pick multiple edges and pick multiple neighbors of them. We provide a theoretical analysis of these algorithms and prove that these new algorithms improve upon the known space and accuracy bounds. We experimentally show that these algorithms outperform well known triangle counting streaming algorithms.
机译:一些用于估计图形流中三角形数量的流算法如下:以足够高的概率对单个三角形进行采样,并重复此基本步骤以获得全局三角形计数。例如,Buriol等人提出的算法。 (PODS 2006)均匀地随机选择单个顶点v和单个边e,并检查将v连接到e的两个交叉边是否出现在流中。类似地,邻域采样算法(PVLDB 2013)尝试通过随机选择单个顶点v,v的单个邻居u来采样三角形,并等待完成三角形的第三条边。在这两种算法中,基本采样步骤都会重复多次,以获得对输入图形流中全局三角形计数的估计。在这项工作中,我们提出了这些算法的多重采样变体:对于Buriol等人的算法,与其随机选择一个顶点和边,不如对多个顶点和多个边进行随机采样,并收集将采样顶点连接到顶点的交叉边缘。采样边缘。在邻域采样算法的情况下,随机选择多个边缘并选择它们的多个邻居。我们提供了对这些算法的理论分析,并证明了这些新算法在已知的空间和精度范围上有所改进。我们实验证明这些算法的性能优于众所周知的三角计数流算法。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号