首页> 外文会议>International conference on very large data bases >LogGP: A Log-based Dynamic Graph Partitioning Method
【24h】

LogGP: A Log-based Dynamic Graph Partitioning Method

机译:LogGP:一种基于日志的动态图分区方法

获取原文

摘要

With the increasing availability and scale of graph data from Web 2.0, graph partitioning becomes one of efficient preprocessing techniques to balance the computing workload. Since the cost of partitioning the entire graph is strictly prohibitive, there are some recent tentative works towards streaming graph partitioning which can run faster, be easily paralleled, and be incrementally updated. Unfortunately, the experiments show that the running time of each partitioning is still unbalanced due to the variation of workload access pattens during the supersteps. In addition, the one-pass streaming partitioning result is not always satisfactory for the algorithms' local view of the graph. In this paper, we present LogGP, a log-based graph partitioning system that records, analyzes and reuses the historical statistical information to refine the partitioning result. LogGP can be used as a middle-ware and deployed to many state-of-the-art paralleled graph processing systems easily. LogGP utilizes the historical partitioning results to generate a hyper-graph and uses a novel hyper-graph streaming partitioning approach to generate a better initial streaming graph partitioning result. During the execution, the system uses running logs to optimize graph partitioning which prevents performance degradation. Moreover, LogGP can dynamically repartition the massive graphs in accordance with the structural changes. Extensive experiments conducted on a moderate size of computing cluster with real-world graph datasets demonstrate the superiority of our approach against the state-of-the-art solutions.
机译:随着Web 2.0中图形数据的可用性和规模的不断增长,图形分区已成为平衡计算工作量的有效预处理技术之一。由于对整个图进行分区的成本严格禁止,因此最近有一些尝试性的工作可以对流图进行分区,它们可以运行得更快,易于并行化并进行增量更新。不幸的是,实验表明,由于超级步骤中工作负载访问模式的变化,每个分区的运行时间仍然不平衡。此外,对于算法的图形局部视图,单次流式分割结果并不总是令人满意。在本文中,我们介绍LogGP,这是一个基于日志的图分区系统,可记录,分析和重用历史统计信息以细化分区结果。 LogGP可以用作中间件,并且可以轻松地部署到许多最新的并行图形处理系统中。 LogGP利用历史分区结果生成超图,并使用新颖的超图流分区方法生成更好的初始流图分区结果。在执行期间,系统使用运行日志来优化图分区,从而防止性能下降。此外,LogGP可以根据结构变化动态重新划分海量图。在中等大小的计算集群上使用真实世界的图形数据集进行的广泛实验证明了我们的方法相对于最新解决方案的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号