首页> 外文会议>Big data >Bisimulation Reduction of Big Graphs on MapReduce
【24h】

Bisimulation Reduction of Big Graphs on MapReduce

机译:MapReduce上大图的双仿真约简

获取原文
获取原文并翻译 | 示例

摘要

Computing the bisimulation partition of a graph is a fundamental problem which plays a key role in a wide range of basic applications. Intuitively, two nodes in a graph are bisimilar if they share basic structural properties such as labeling and neighborhood topology. In data management, reducing a graph under bisimulation equivalence is a crucial step, e.g., for indexing the graph for efficient query processing. Often, graphs of interest in the real world are massive; examples include social networks and linked open data. For analytics on such graphs, it is becoming increasingly infeasible to rely on in-memory or even I/O-efficient solutions. Hence, a trend in Big Data analytics is the use of distributed computing frameworks such as MapReduce. While there are both internal and external memory solutions for efficiently computing bisimulation, there is, to our knowledge, no effective MapReduce-based solution for bisimulation. Motivated by these observations we propose in this paper the first efficient MapReduce-based algorithm for computing the bisimulation partition of massive graphs. We also detail several optimizations for handling the data skew which often arises in real-world graphs. The results of an extensive empirical study are presented which demonstrate the effectiveness and scalability of our solution.
机译:计算图的双仿真分区是一个基本问题,它在广泛的基本应用中起着关键作用。直观地,图中的两个节点如果共享基本的结构属性(例如标记和邻域拓扑),则是双相似的。在数据管理中,在双模拟等效条件下减少图形是至关重要的步骤,例如,为索引图形以进行有效的查询处理。通常,现实世界中的关注图非常庞大;例子包括社交网络和链接的开放数据。对于此类图形的分析,依靠内存或什至I / O效率高的解决方案变得越来越不可行。因此,大数据分析的趋势是使用分布式计算框架(例如MapReduce)。虽然有内部和外部内存解决方案都可以有效地计算双仿真,但据我们所知,没有有效的基于MapReduce的双仿真解决方案。基于这些观察,我们在本文中提出了第一种基于有效的MapReduce的算法,用于计算大规模图的双仿真分区。我们还详细介绍了几种优化方法,用于处理现实世界图中经常出现的数据偏斜。提出了广泛的经验研究结果,这些结果证明了我们解决方案的有效性和可扩展性。

著录项

  • 来源
    《Big data》|2013年|189-203|共15页
  • 会议地点 Oxford(GB)
  • 作者单位

    Eindhoven University of Technology, The Netherlands;

    Eindhoven University of Technology, The Netherlands;

    Eindhoven University of Technology, The Netherlands;

    Eindhoven University of Technology, The Netherlands;

    Delft University of Technology, The Netherlands;

    Indiana University, Bloomington, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-26 14:28:20

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号