首页> 外文会议>International conference on very large data bases >Scalable Subgraph Enumeration in MapReduce
【24h】

Scalable Subgraph Enumeration in MapReduce

机译:MapReduce中的可伸缩子图枚举

获取原文

摘要

Subgraph enumeration, which aims to find all the subgraphs of a large data graph that are isomorphic to a given pattern graph, is a fundamental graph problem with a wide range of applications. However, existing sequential algorithms for subgraph enumeration fall short in handling large graphs due to the involvement of computationally intensive subgraph isomorphism operations. Thus, some recent researches focus on solving the problem using MapReduce. Nevertheless, exiting MapReduce approaches are not scalable to handle very large graphs since they either produce a huge number of partial results or consume a large amount of memory. Motivated by this, in this paper, we propose a new algorithm TwinTwigJoin based on a left-deep-join framework in MapReduce, in which the basic join unit is a TwinTwig (an edge or two incident edges of a node). We show that in the Erdos-Renyi random-graph model, TwinTwigJoin is instance optimal in the left-deep-join framework under reasonable assumptions, and we devise an algorithm to compute the optimal join plan. Three optimization strategies are explored to improve our algorithm. Furthermore, we discuss how our approach can be adapted in the power-law random-graph model.We conduct extensive performance studies in several real graphs, one of which contains billions of edges. Our approach significantly outperforms existing solutions in all tests.
机译:子图枚举旨在查找与给定模式图同构的大型数据图的所有子图,这是一个具有广泛应用范围的基本图问题。但是,由于涉及计算量大的子图同构运算,现有的子图枚举顺序算法在处理大型图时不够。因此,最近的一些研究集中在使用MapReduce解决问题上。但是,现有的MapReduce方法无法扩展以处理非常大的图形,因为它们要么产生大量的部分结果,要么消耗大量的内存。为此,本文提出了一种基于MapReduce中左深连接框架的新算法TwinTwigJoin,其中基本连接单元为TwinTwig(节点的一条边或两个入射边)。我们证明在Erdos-Renyi随机图模型中,在合理的假设下,TwinTwigJoin在左深联接框架中是实例最优的,并且设计了一种算法来计算最优联接计划。探索了三种优化策略来改进我们的算法。此外,我们讨论了如何在幂律随机图模型中调整我们的方法。我们在几个真实的图中进行了广泛的性能研究,其中一个包含数十亿条边。我们的方法在所有测试中均明显优于现有解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号