首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Efficient data partitioning model for heterogeneous graphs in the cloud
【24h】

Efficient data partitioning model for heterogeneous graphs in the cloud

机译:云中异构图的高效数据分区模型

获取原文

摘要

As the size and variety of information networks continue to grow in many scientific and engineering domains, we witness a growing demand for efficient processing of large heterogeneous graphs using a cluster of compute nodes in the Cloud. One open issue is how to effectively partition a large graph to process complex graph operations efficiently. In this paper, we present VB-Partitioner - a distributed data partitioning model and algorithms for efficient processing of graph operations over large-scale graphs in the Cloud. Our VB-Partitioner has three salient features. First, it introduces vertex blocks (VBs) and extended vertex blocks (EVBs) as the building blocks for semantic partitioning of large graphs. Second, VB-Partitioner utilizes vertex block grouping algorithms to place those vertex blocks that have high correlation in graph structure into the same partition. Third, VB-Partitioner employs a VB-partition guided query partitioning model to speed up the parallel processing of graph pattern queries by reducing the amount of inter-partition query processing. We conduct extensive experiments on several real-world graphs with millions of vertices and billions of edges. Our results show that VB-Partitioner significantly outperforms the popular random block-based data partitioner in terms of query latency and scalability over large-scale graphs.
机译:随着信息网络的大小和各种在许多科学和工程域中继续增长,我们目睹了使用云中的计算节点的群集有效地处理大型异构图。一个开放问题是如何有效地分区大图以有效地处理复杂的图形操作。在本文中,我们呈现了VB-Partitioner - 一种分布式数据分区模型和算法,以便在云中的大规模图中有效地处理图形操作。我们的VB-Partitioner具有三个突出功能。首先,它引入顶点块(VBS)和扩展顶点块(EVB)作为大图的语义分区的构建块。其次,VB-Partitioner利用顶点块分组算法将具有高相关的顶点块放入相同的分区中。第三,VB-Partitioner采用VB分区引导查询分区模型,通过减少分区间查询处理的量来加速图形模式查询的并行处理。我们对几个具有数百万个顶点和数十亿边缘的几个真实图表进行了广泛的实验。我们的结果表明,VB-Partitioner在大规模图表中的查询延迟和可扩展性方面显着优于基于流行的随机块的数据分区。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号