【24h】

How to partition a billion-node graph

机译:如何分割十亿个节点图

获取原文

摘要

Billion-node graphs pose significant challenges at all levels from storage infrastructures to programming models. It is critical to develop a general purpose platform for graph processing. A distributed memory system is considered a feasible platform supporting online query processing as well as offline graph analytics. In this paper, we study the problem of partitioning a billion-node graph on such a platform, an important consideration because it has direct impact on load balancing and communication overhead. It is challenging not just because the graph is large, but because we can no longer assume that the data can be organized in arbitrary ways to maximize the performance of the partitioning algorithm. Instead, the algorithm must adopt the same data and programming model adopted by the system and other applications. In this paper, we propose a multi-level label propagation (MLP) method for graph partitioning. Experimental results show that our solution can partition billion-node graphs within several hours on a distributed memory system consisting of merely several machines, and the quality of the partitions produced by our approach is comparable to state-of-the-art approaches applied on toy-size graphs.
机译:从存储基础架构到编程模型,十亿个节点的图形在各个级别上都构成了重大挑战。开发用于图形处理的通用平台至关重要。分布式内存系统被认为是支持在线查询处理以及离线图分析的可行平台。在本文中,我们研究了在这样的平台上划分十亿个节点图的问题,这是一个重要的考虑因素,因为它直接影响负载平衡和通信开销。这具有挑战性,不仅因为图很大,而且因为我们不再能够假定可以以任意方式组织数据以最大化分区算法的性能。相反,该算法必须采用系统和其他应用程序采用的相同数据和编程模型。在本文中,我们提出了一种用于图划分的多级标签传播(MLP)方法。实验结果表明,我们的解决方案可以在仅由几台机器组成的分布式存储系统中在数小时内对十亿个节点的图进行分区,并且我们的方法产生的分区质量可与应用于玩具的最新方法相媲美。尺寸的图表。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号