首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >Improved Triangle Counting in Graph Streams: Power of Multi-Sampling
【24h】

Improved Triangle Counting in Graph Streams: Power of Multi-Sampling

机译:在图形流中改进三角形计数:多抽样的力量

获取原文
获取外文期刊封面目录资料

摘要

Some of the well known streaming algorithms to estimate number of triangles in a graph stream work as follows: Sample a single triangle with high enough probability and repeat this basic step to obtain a global triangle count. For example, the algorithm due to Buriol et al. (PODS 2006) uniformly at random picks a single vertex v and a single edge e and checks whether the two cross edges that connect v to e appear in the stream. Similarly, the neighborhood sampling algorithm (PVLDB 2013) attempts to sample a triangle by randomly choosing a single vertex v, a single neighbor u of v and waits for a third edge that completes the triangle. In both the algorithms, the basic sampling step is repeated multiple times to obtain an estimate for the global triangle count in the input graph stream. In this work, we propose a multi-sampling variant of these algorithms: In case of Buriol et al's algorithm, instead of randomly choosing a single vertex and edge, randomly sample multiple vertices and multiple edges and collect cross edges that connect sampled vertices to the sampled edges. In case of neighborhood sampling algorithm, randomly pick multiple edges and pick multiple neighbors of them. We provide a theoretical analysis of these algorithms and prove that these new algorithms improve upon the known space and accuracy bounds. We experimentally show that these algorithms outperform well known triangle counting streaming algorithms.
机译:一些众所周知的流式流算法以估计图形流中的三角形的数量,如下所示:采样具有足够高的概率并重复该基本步骤以获得全局三角形计数的单个三角形。例如,由于BuriOl等人引起的算法。 (PODS 2006)在随机均匀地挑选单个顶点V和单个边缘E,并检查将v到e连接的两个交叉边是否出现在流中。类似地,邻域采样算法(PVLDB 2013)尝试通过随机选择单个顶点V,单个邻居u的v并等待完成三角形的第三边缘来对三角形进行采样。在两个算法中,基本采样步骤重复多次以获得输入图形流中的全局三角形计数的估计。在这项工作中,我们提出了一种这些算法的多抽样变体:在Buriol等人的算法的情况下,而不是随机选择单个顶点和边缘,随机采样多个顶点和多个边缘,并收集将采样顶点连接到的交叉边缘采样边缘。在邻域采样算法的情况下,随机选择多个边缘并选择它们的多个邻居。我们提供了对这些算法的理论分析,并证明了这些新的算法改善了已知的空间和准确性界限。我们通过实验表明这些算法优于众所周知的三角形计数流算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号